INDEX
Explanations
statements and claims regarding advocacy and support for social issues
New Auto-Interp
Negative Logits
Banana
-0.81
RIP
-0.73
Canal
-0.71
RIP
-0.69
Heavenly
-0.68
Reincarn
-0.68
Ruler
-0.67
Sakuya
-0.67
Delicious
-0.65
Bye
-0.65
POSITIVE LOGITS
accuse
1.06
rallied
1.05
argue
1.02
lobbied
0.98
allege
0.98
coales
0.97
rallying
0.97
mobilized
0.97
urged
0.96
petition
0.94
Activations Density 0.220%