INDEX
Explanations
mentions of former political leaders and officials
New Auto-Interp
Negative Logits
uyo
-0.16
enting
-0.15
ÃŃr
-0.15
ascar
-0.15
endum
-0.14
ÃŃda
-0.14
VG
-0.14
ipt
-0.14
Ñĸнг
-0.14
ilk
-0.14
POSITIVE LOGITS
former
0.19
býval
0.17
/pub
0.17
Former
0.16
helicopt
0.16
MD
0.15
Former
0.15
alumni
0.15
former
0.15
OOT
0.15
Activations Density 0.087%