INDEX
Explanations
mentions of political parties
New Auto-Interp
Negative Logits
Grizz
-0.81
Kard
-0.70
Wheat
-0.70
Dull
-0.70
Drake
-0.70
Deter
-0.70
Hoo
-0.69
alam
-0.67
angelo
-0.66
ravings
-0.65
POSITIVE LOGITS
affiliation
1.06
leader
1.01
leaders
0.97
leader
0.97
leadership
0.97
affili
0.95
Leader
0.93
leaders
0.92
members
0.92
arians
0.92
Activations Density 0.024%