INDEX
Explanations
phrases related to violent actions or crimes
references to violent crimes and suspects involved in them
New Auto-Interp
Negative Logits
Enhanced
-0.66
ausp
-0.65
introductory
-0.64
podcast
-0.64
oult
-0.62
OLOG
-0.62
renaissance
-0.62
advisory
-0.61
quir
-0.60
IDE
-0.60
POSITIVE LOGITS
angering
0.80
injuring
0.80
utterstock
0.78
senseless
0.78
ribly
0.75
fleeing
0.71
injure
0.71
whom
0.70
uckles
0.70
resisting
0.69
Activations Density 0.953%