INDEX
Explanations
terms related to political events or issues
terms related to political contexts and long-term effects
New Auto-Interp
Negative Logits
idge
-0.72
illard
-0.71
icultural
-0.68
icut
-0.66
lys
-0.66
zona
-0.66
atible
-0.65
iled
-0.64
offensive
-0.64
pron
-0.64
POSITIVE LOGITS
Isis
0.67
pei
0.65
akia
0.65
jri
0.65
Shak
0.64
Amon
0.64
Bever
0.64
sha
0.63
jan
0.63
bounty
0.63
Activations Density 0.000%