INDEX
Explanations
names of people and specific references to individuals
New Auto-Interp
Negative Logits
fat
-0.16
inya
-0.16
ležit
-0.15
apt
-0.15
urally
-0.14
klady
-0.14
obody
-0.14
reesome
-0.14
ãĤĩ
-0.14
licity
-0.14
POSITIVE LOGITS
(DE
0.17
adero
0.17
dent
0.15
oyer
0.15
uman
0.14
ãĥĨãĥ«
0.14
yll
0.14
alion
0.14
/Register
0.14
rement
0.14
Activations Density 0.092%