INDEX
Explanations
phrases related to social and political controversies or uprisings
New Auto-Interp
Negative Logits
é¾įå
-0.62
ACTED
-0.58
matched
-0.53
chens
-0.52
chester
-0.51
afford
-0.50
Paddock
-0.49
undone
-0.48
flats
-0.48
OnePlus
-0.48
POSITIVE LOGITS
ire
0.75
disbelief
0.72
wrath
0.72
controversy
0.71
fury
0.70
curiosity
0.69
frenzy
0.69
creativity
0.68
indignation
0.67
laughter
0.67
Activations Density 10.956%