INDEX
Explanations
words related to physical damage, wounds, or graphic descriptions of bodily harm
themes related to violence and its consequences
New Auto-Interp
Negative Logits
izont
-0.66
otiation
-0.64
elfth
-0.62
hander
-0.61
imeo
-0.59
assad
-0.59
ollower
-0.59
Participant
-0.58
akening
-0.58
rolet
-0.57
POSITIVE LOGITS
goodies
0.94
fumes
0.84
tentacles
0.84
junk
0.84
feces
0.81
crap
0.80
smells
0.79
acron
0.79
needles
0.79
horrors
0.78
Activations Density 0.442%