INDEX
Explanations
words related to violent or aggressive content in cartoons
references to violence and aggression in media
New Auto-Interp
Negative Logits
ufact
-1.05
wered
-0.91
grim
-0.90
çīĪ
-0.83
Sov
-0.83
udget
-0.82
thora
-0.79
Sov
-0.77
culosis
-0.77
aqu
-0.77
POSITIVE LOGITS
cues
1.46
unconsciously
1.37
subconscious
1.36
behavioral
1.35
behaviors
1.33
perceptual
1.33
psychologists
1.32
cognitive
1.31
motivational
1.31
dopamine
1.25
Activations Density 0.678%