INDEX
Explanations
keywords related to news headlines or current events
references to specific election events and political figures
New Auto-Interp
Negative Logits
appre
-0.74
assum
-0.70
nav
-0.69
devs
-0.68
corpus
-0.67
Reviewer
-0.66
eday
-0.65
blat
-0.64
metab
-0.63
Lear
-0.63
POSITIVE LOGITS
isconsin
0.89
Republican
0.81
psons
0.73
congressional
0.72
ptive
0.71
hillary
0.71
izza
0.70
PAC
0.69
esson
0.69
aska
0.67
Activations Density 0.077%