INDEX
Explanations
terms related to the country Russia and its influence or actions
New Auto-Interp
Negative Logits
¬
-0.18
ifr
-0.16
izon
-0.16
atos
-0.15
ž
-0.15
eker
-0.15
iswa
-0.14
asley
-0.14
ress
-0.14
ahu
-0.14
POSITIVE LOGITS
Federation
0.18
Russia
0.17
ãĥ¼ãĥª
0.17
fed
0.17
Dmit
0.16
Fed
0.16
Фед
0.16
Russian
0.15
ìĭľìķĦ
0.15
Roulette
0.15
Activations Density 0.028%