INDEX
Explanations
phrases related to violent acts or crimes
references to violent acts, particularly killings and murders
New Auto-Interp
Negative Logits
Cola
-0.87
Stud
-0.72
amina
-0.70
aque
-0.70
Radio
-0.67
Tire
-0.66
Student
-0.65
Plex
-0.63
ais
-0.63
Jer
-0.62
POSITIVE LOGITS
killings
1.14
murders
1.04
shootings
0.99
spree
0.98
homicides
0.98
poons
0.95
ongs
0.89
slaying
0.88
slay
0.82
shooting
0.81
Activations Density 0.015%