INDEX
Explanations
words related to support or promotion of a cause or belief
discussions about advocacy or recommendations in a political context
New Auto-Interp
Negative Logits
exit
-0.80
processing
-0.78
identification
-0.78
interaction
-0.77
stunning
-0.76
background
-0.75
chance
-0.74
release
-0.74
alert
-0.72
timing
-0.70
POSITIVE LOGITS
advocated
2.22
preached
2.08
advocating
1.88
preach
1.70
prescribe
1.62
instituted
1.59
coined
1.38
debated
1.32
pioneered
1.31
preaching
1.30
Activations Density 0.076%