INDEX
Explanations
the word "for" in various contexts
New Auto-Interp
Negative Logits
439
-0.15
Gors
-0.15
57
-0.15
39
-0.15
ru
-0.14
process
-0.14
65
-0.14
Kavanaugh
-0.14
xit
-0.14
CType
-0.14
POSITIVE LOGITS
acer
0.18
unately
0.16
kees
0.16
aml
0.15
ahl
0.15
aland
0.15
earn
0.15
bak
0.15
मà¤ķ
0.15
kiye
0.14
Activations Density 0.511%