INDEX
Explanations
aspects related to political figures and their actions or statements
New Auto-Interp
Negative Logits
ört
-0.15
ersh
-0.15
ammo
-0.15
ÑĨеп
-0.14
pcm
-0.14
ataire
-0.14
bor
-0.14
.updateDynamic
-0.13
ahun
-0.13
jk
-0.13
POSITIVE LOGITS
former
0.62
Former
0.55
Former
0.49
former
0.49
retired
0.37
býval
0.34
erst
0.32
formerly
0.29
ex
0.29
коли
0.27
Activations Density 0.162%