INDEX
Explanations
references to political figures and actions
phrases related to political declarations and criticisms of leadership
New Auto-Interp
Negative Logits
theless
-0.67
endum
-0.66
pmwiki
-0.65
subsequ
-0.64
Footnote
-0.62
afore
-0.60
effected
-0.59
appell
-0.59
®,
-0.58
techn
-0.58
POSITIVE LOGITS
Kavanaugh
0.82
æŃ¦
0.76
MORE
0.75
}}
0.69
Dems
0.63
]}
0.63
Virginia
0.61
æ©
0.61
celeb
0.60
ļéĨĴ
0.60
Activations Density 0.056%