INDEX
Explanations
words and phrases related to personal involvement or possession
New Auto-Interp
Negative Logits
velt
-0.17
ika
-0.16
oni
-0.16
cmdline
-0.16
busy
-0.15
@n
-0.15
živ
-0.15
oir
-0.15
tout
-0.14
cape
-0.14
POSITIVE LOGITS
eko
0.15
acie
0.15
enci
0.14
Bod
0.14
Pear
0.14
лон
0.14
Duffy
0.14
Bund
0.14
bund
0.13
eton
0.13
Activations Density 0.004%