INDEX
Explanations
variations of the word "I" and references to interpersonal relationships
New Auto-Interp
Negative Logits
ogg
-0.16
realised
-0.15
SavaÅŁ
-0.15
oenix
-0.15
felt
-0.15
chaft
-0.14
-valu
-0.14
.tell
-0.14
becomes
-0.14
realise
-0.14
POSITIVE LOGITS
belong
0.17
ÑĸмÑĸ
0.16
344
0.16
belongs
0.15
лагод
0.14
indeed
0.14
areth
0.14
dot
0.14
æĺ¯åľ¨
0.14
957
0.14
Activations Density 0.265%