INDEX
Explanations
violent actions and incidents involving physical harm
phrases and words related to physical violence and aggression
New Auto-Interp
Negative Logits
DragonMagazine
-0.83
glers
-0.80
inventoryQuantity
-0.78
values
-0.68
Factor
-0.67
href
-0.66
imester
-0.65
toggle
-0.65
udes
-0.64
elist
-0.63
POSITIVE LOGITS
repeatedly
1.07
unconscious
1.00
twice
0.94
forehead
0.92
buttocks
0.89
violently
0.88
abdomen
0.88
senseless
0.88
throat
0.88
cheek
0.87
Activations Density 0.174%