INDEX
Explanations
mentions of political affiliations, specifically focusing on Democrats and Republicans
New Auto-Interp
Negative Logits
obyl
-0.72
icles
-0.69
iddles
-0.68
weather
-0.65
oken
-0.64
obar
-0.63
urious
-0.62
iffs
-0.62
ouston
-0.61
ysics
-0.61
POSITIVE LOGITS
Party
0.64
maid
0.59
standby
0.55
stronghold
0.54
operative
0.53
naissance
0.53
counterpart
0.53
hood
0.53
士
0.52
geist
0.52
Activations Density 7.934%