INDEX
Explanations
The neuron is looking for mentions of political parties and related activities
mentions of political parties
New Auto-Interp
Negative Logits
ravings
-0.80
alam
-0.77
Grizz
-0.75
aneously
-0.73
Hoo
-0.70
apons
-0.70
obar
-0.66
Wheat
-0.66
alez
-0.65
Dull
-0.65
POSITIVE LOGITS
affiliation
1.29
affili
1.05
leadership
1.05
primaries
0.98
nominate
0.97
nominating
0.97
caucus
0.96
leader
0.95
nominee
0.95
caucuses
0.94
Activations Density 0.052%