INDEX
Explanations
references to a specific geographical location, Moscow
references to the city of Moscow
New Auto-Interp
Negative Logits
âĢ¢âĢ¢âĢ¢âĢ¢
-0.82
Thom
-0.79
ICAN
-0.79
âķIJâķIJ
-0.77
HM
-0.74
AAAA
-0.73
pir
-0.73
ERA
-0.72
IRD
-0.71
SHARE
-0.71
POSITIVE LOGITS
rall
1.12
Moscow
1.05
Lumpur
0.99
Kremlin
0.96
ascus
0.87
achev
0.81
kaya
0.78
Moscow
0.77
mosqu
0.76
akov
0.75
Activations Density 0.005%