INDEX
Explanations
phrases encouraging action or expression
phrases related to action and assertiveness in a political context
New Auto-Interp
Negative Logits
ļéĨĴ
-0.76
lav
-0.71
wic
-0.70
Weekly
-0.63
Intake
-0.63
riad
-0.61
Testing
-0.60
Luck
-0.60
imore
-0.60
eties
-0.60
POSITIVE LOGITS
declare
1.16
proclaim
1.15
admit
1.03
scream
1.00
shout
0.99
embrace
0.96
say
0.95
apologize
0.94
announce
0.94
unleash
0.94
Activations Density 0.130%