INDEX
Explanations
adverbs, particularly those ending in 'ly' and various forms of 'mente'
New Auto-Interp
Negative Logits
ovky
-0.15
RIPT
-0.15
gens
-0.15
oin
-0.14
aged
-0.14
oved
-0.13
loit
-0.13
chatte
-0.13
ÏĦεÏģ
-0.13
ırı
-0.13
POSITIVE LOGITS
cond
0.15
esa
0.15
cal
0.14
Flash
0.14
ocket
0.14
jez
0.14
UF
0.14
ucer
0.14
IME
0.14
heim
0.13
Activations Density 0.083%