INDEX
Explanations
various forms of the word "accept" and its related expressions
New Auto-Interp
Negative Logits
859
-0.19
zÅij
-0.15
lá
-0.15
abouts
-0.14
lev
-0.14
omor
-0.14
urai
-0.14
ahl
-0.14
esis
-0.14
ASA
-0.14
POSITIVE LOGITS
offered
0.24
offer
0.23
responsibility
0.22
offers
0.20
offers
0.19
challenge
0.19
challenge
0.19
premise
0.18
Responsibility
0.18
offer
0.18
Activations Density 0.119%