INDEX
Explanations
mentions of people taking actions in response to events or expressing opinions on social platforms
references to actions taken by individuals or groups, particularly in the context of social or political activity
New Auto-Interp
Negative Logits
INGS
-0.71
CLASSIFIED
-0.70
afia
-0.68
ements
-0.64
ablishment
-0.64
ascript
-0.62
/-
-0.61
glomer
-0.61
ativity
-0.58
don
-0.58
POSITIVE LOGITS
0.91
0.86
pless
0.82
0.76
amph
0.74
social
0.72
0.71
forums
0.71
umblr
0.70
stride
0.69
Activations Density 0.052%