INDEX
Explanations
references to public opinion and voting behavior
phrases and words related to public opinion and societal issues
New Auto-Interp
Negative Logits
aliases
-0.68
ortment
-0.66
promotion
-0.65
lic
-0.63
delet
-0.63
Xuan
-0.63
Equip
-0.62
ourney
-0.62
tein
-0.62
iba
-0.61
POSITIVE LOGITS
overwhelmingly
1.20
clam
1.12
vote
1.05
distrust
1.01
revolt
0.98
perceive
0.94
Vote
0.93
mistrust
0.93
electing
0.92
forgive
0.89
Activations Density 0.458%