INDEX
Explanations
the name "Vladimir Putin"
mentions of Vladimir Putin
New Auto-Interp
Negative Logits
dress
-0.84
cular
-0.74
roads
-0.74
cule
-0.73
kick
-0.72
scenes
-0.70
backer
-0.70
alach
-0.69
growth
-0.67
cut
-0.67
POSITIVE LOGITS
Putin
1.28
Vlad
1.05
Vladimir
1.02
Nab
0.98
Ily
0.98
Lenin
0.97
Jinping
0.95
Dmitry
0.94
Tayyip
0.89
Lavrov
0.89
Activations Density 0.005%