INDEX
Explanations
expressions of speech or quotes in the text
New Auto-Interp
Negative Logits
jar
-0.16
inks
-0.15
inky
-0.15
ÃŃna
-0.15
ph
-0.15
cope
-0.15
NES
-0.14
Jen
-0.14
imens
-0.14
itals
-0.14
POSITIVE LOGITS
Sne
0.17
Chambers
0.15
olucion
0.14
cầm
0.14
utomation
0.14
Pied
0.14
abama
0.13
ekce
0.13
agt
0.13
fout
0.13
Activations Density 0.031%