INDEX
Explanations
references to the Republican strategist Karl Rove
mentions of specific political figures and organizations
New Auto-Interp
Negative Logits
ambers
-0.82
isters
-0.81
ococ
-0.78
ivism
-0.77
ocks
-0.76
utes
-0.75
uador
-0.74
atsu
-0.70
ancouver
-0.70
reprene
-0.70
POSITIVE LOGITS
Rove
1.21
Dire
0.94
wolves
0.91
wolf
0.90
Ratt
0.83
cones
0.82
nces
0.80
wald
0.78
bil
0.76
tto
0.76
Activations Density 0.015%