INDEX
Explanations
mentions of the country "Russia" or things related to Russia
references to Russia or Russian identity
New Auto-Interp
Negative Logits
-->
-0.73
:=
-0.73
arde
-0.70
CHR
-0.67
addle
-0.67
fmt
-0.66
Giles
-0.66
DH
-0.65
intrins
-0.63
[&
-0.63
POSITIVE LOGITS
Russian
3.58
Russian
3.19
Russians
2.96
Kremlin
2.60
Russia
2.59
Russia
2.55
Ukrainian
2.50
Moscow
2.40
Moscow
2.32
Putin
2.19
Activations Density 0.015%