INDEX
Explanations
phrases related to taking action or activism
references to specific calls to action or initiatives
New Auto-Interp
Negative Logits
plateau
-0.73
lf
-0.65
cottage
-0.65
conce
-0.65
¾
-0.63
eclips
-0.63
ettel
-0.63
wn
-0.61
ld
-0.61
ome
-0.61
POSITIVE LOGITS
Action
3.90
Action
2.79
ACTION
2.63
action
2.41
Actions
2.10
action
1.98
ACTIONS
1.42
actions
1.38
ACTION
1.38
actions
1.23
Activations Density 0.013%