INDEX
Explanations
references to the Russian President, Vladimir Putin
mentions of Vladimir Putin
New Auto-Interp
Negative Logits
dress
-0.80
roads
-0.75
kick
-0.73
cule
-0.72
cular
-0.71
--------------------------------------------------------
-0.69
backer
-0.68
words
-0.68
eating
-0.66
eals
-0.66
POSITIVE LOGITS
Putin
1.26
Vlad
1.02
Nab
0.97
Ily
0.96
Dmitry
0.95
Vladimir
0.94
Jinping
0.94
Lenin
0.94
Zh
0.90
Tayyip
0.88
Activations Density 0.008%