INDEX
Explanations
phrases related to confrontation or conflict
strong opinions or reactions about superhero movies and related cultural commentary
New Auto-Interp
Negative Logits
hari
-0.70
(>
-0.69
UNCLASSIFIED
-0.67
'[
-0.66
ItemImage
-0.66
Palest
-0.66
ascript
-0.64
[+
-0.64
XY
-0.63
Therefore
-0.63
POSITIVE LOGITS
surprises
0.80
eyebrows
0.68
PHOTOS
0.68
downright
0.63
headlines
0.62
alarms
0.61
Clown
0.61
ueller
0.61
selfie
0.61
laughs
0.61
Activations Density 1.204%