INDEX
Explanations
text related to expressing support for various causes or individuals
references to political support and alliances
New Auto-Interp
Negative Logits
ipl
-0.75
pandemonium
-0.70
wow
-0.63
prest
-0.61
seams
-0.59
overlap
-0.58
abulary
-0.57
mirrors
-0.57
dust
-0.56
Affect
-0.55
POSITIVE LOGITS
boycott
0.98
initiatives
0.95
efforts
0.91
legalization
0.90
candidacy
0.90
financially
0.89
uncond
0.89
underdog
0.87
against
0.85
advoc
0.84
Activations Density 0.376%