INDEX
Explanations
phrases related to political campaigning and electoral processes
New Auto-Interp
Head Attr Weights
0:0.01
1:0.01
2:0.05
3:0.06
4:0.13
5:0.02
6:0.03
7:0.36
8:0.03
9:0.03
10:0.12
11:0.09
Negative Logits
lethal
-1.55
Redditor
-1.44
�
-1.37
alore
-1.36
Cod
-1.34
aurus
-1.32
trigger
-1.32
Output
-1.31
trop
-1.31
Crunch
-1.31
POSITIVE LOGITS
advoc
1.63
"],"
1.48
purity
1.47
behalf
1.46
patriotism
1.45
eting
1.44
lobb
1.42
endorsements
1.42
superiority
1.41
passionately
1.41
Activations Density 0.004%