INDEX
Explanations
specific endings of words, particularly "-ent"
New Auto-Interp
Negative Logits
Marian
-0.15
Epstein
-0.15
uggage
-0.14
uppy
-0.14
othermal
-0.14
omens
-0.14
Por
-0.14
Ĩ
-0.14
mass
-0.13
itches
-0.13
POSITIVE LOGITS
istrov
0.16
iros
0.16
dad
0.15
bart
0.15
yre
0.15
GRAT
0.14
eah
0.14
129
0.14
onis
0.14
bons
0.14
Activations Density 0.000%