INDEX
Explanations
mention of violent actions and incidents, specifically involving shooting and killing
phrases related to violent incidents and casualties
New Auto-Interp
Negative Logits
heit
-0.88
Administ
-0.83
Username
-0.79
ional
-0.75
SPONSORED
-0.72
soType
-0.72
VERTISEMENT
-0.66
TERN
-0.66
Factor
-0.65
Granted
-0.65
POSITIVE LOGITS
torped
0.86
espresso
0.85
sparks
0.79
lasers
0.76
bullets
0.71
bang
0.71
Shotgun
0.70
unarmed
0.70
missiles
0.70
cannon
0.69
Activations Density 0.148%