INDEX
Explanations
references to political leaders and officials
New Auto-Interp
Negative Logits
agi
-0.15
.Guna
-0.15
âĸį
-0.15
emmel
-0.15
itos
-0.14
ALLE
-0.14
ÏģÎŃ
-0.14
untas
-0.14
ofile
-0.14
Ñĥк
-0.14
POSITIVE LOGITS
spokesman
0.19
spokeswoman
0.19
spokesperson
0.17
said
0.15
on
0.15
sp
0.15
Wyn
0.15
on
0.15
Ľå»º
0.15
Pink
0.14
Activations Density 0.118%