INDEX
Explanations
references to ethnic and national identity, particularly regarding Bulgarians and their history
New Auto-Interp
Negative Logits
axed
-0.17
jection
-0.15
repid
-0.15
orate
-0.14
Ñijм
-0.14
obe
-0.14
Schwartz
-0.14
alace
-0.14
chim
-0.14
dez
-0.14
POSITIVE LOGITS
uru
0.15
/english
0.14
ogne
0.14
RESET
0.13
лÑĮ
0.13
ONGL
0.13
pied
0.13
masses
0.13
ische
0.13
Roger
0.13
Activations Density 0.057%