INDEX
Explanations
references to political parties and candidates
New Auto-Interp
Negative Logits
alam
-0.74
Grizz
-0.73
Kard
-0.71
Dull
-0.70
ravings
-0.69
Hoo
-0.67
Wheat
-0.66
Maw
-0.65
DEN
-0.65
DERR
-0.64
POSITIVE LOGITS
affiliation
1.14
affili
1.00
leadership
0.93
goers
0.90
leader
0.87
Leader
0.85
membership
0.84
leaders
0.84
faithful
0.83
leader
0.82
Activations Density 0.381%