INDEX
Explanations
references to political party affiliations and sentiments
New Auto-Interp
Head Attr Weights
0:0.12
1:0.01
2:0.15
3:0.06
4:0.08
5:0.03
6:0.07
7:0.02
8:0.14
9:0.03
10:0.07
11:0.16
Negative Logits
ftime
-1.63
vote
-1.42
rehe
-1.39
aisle
-1.37
plurality
-1.35
throb
-1.34
orum
-1.34
bip
-1.29
clot
-1.27
Votes
-1.26
POSITIVE LOGITS
alike
1.84
iliated
1.63
Gamma
1.49
ossal
1.42
typh
1.37
undy
1.33
respectively
1.32
Typh
1.30
GV
1.30
esters
1.27
Activations Density 0.018%