INDEX
Explanations
mentions of politicians and political events
New Auto-Interp
Negative Logits
viation
-0.76
eric
-0.68
ruary
-0.66
lag
-0.63
ISBN
-0.63
ibility
-0.63
estones
-0.62
ulia
-0.61
tion
-0.60
acl
-0.59
POSITIVE LOGITS
Kavanaugh
0.85
Senate
0.73
Dems
0.66
Ħ¢
0.65
Ford
0.62
%]
0.62
avanaugh
0.62
accuser
0.61
').
0.61
asse
0.60
Activations Density 0.518%