INDEX
Explanations
the word "almost" in various contexts
New Auto-Interp
Negative Logits
ear
-0.17
eer
-0.16
uled
-0.16
orde
-0.15
g
-0.15
ear
-0.14
Jamal
-0.14
d
-0.14
ninger
-0.14
e
-0.14
POSITIVE LOGITS
lied
0.15
.infinity
0.15
estone
0.15
縮
0.14
prostituer
0.14
_axis
0.14
iglia
0.14
adio
0.14
exo
0.14
売
0.14
Activations Density 0.016%