INDEX
Explanations
references to Russia and its leaders
New Auto-Interp
Negative Logits
MLLoader
-0.61
ագրություններ
-0.61
AccessException
-0.56
étit
-0.56
autorytatywna
-0.54
uch
-0.53
openzeppelin
-0.52
metria
-0.52
mentaux
-0.51
wap
-0.51
POSITIVE LOGITS
Russia
1.22
Russian
1.12
Soviet
1.12
Russians
1.06
Russia
1.04
Russie
1.03
USSR
0.96
Soviet
0.92
RUSS
0.92
Russian
0.90
Activations Density 0.174%