INDEX
Explanations
mentions of locations and events related to political activities
mentions of political events and entities
New Auto-Interp
Negative Logits
âĢ
-0.81
by
-0.79
withd
-0.73
â
-0.68
Ö¼
-0.66
By
-0.65
âĸĪ
-0.61
ãĢ
-0.60
âĢł
-0.60
âĻ
-0.60
POSITIVE LOGITS
imeo
0.63
rompt
0.57
complement
0.57
contrasted
0.57
ĪĴ
0.55
cknow
0.54
outweigh
0.54
onwards
0.52
playbook
0.52
entimes
0.52
Activations Density 0.903%