INDEX
Explanations
references to Russia and its related entities
New Auto-Interp
Negative Logits
rhestr
-0.54
Welsh
-0.45
Welsh
-0.45
surla
-0.43
withIdentifier
-0.40
fos
-0.38
delwed
-0.38
Transverse
-0.38
Chwiliwch
-0.37
wales
-0.37
POSITIVE LOGITS
Russia
0.98
Russian
0.96
Russians
0.87
Moscow
0.86
Russia
0.81
Moscú
0.80
Russian
0.79
Rusia
0.78
Russie
0.77
Russland
0.77
Activations Density 0.143%