INDEX
Explanations
phrases depicting violent actions
instances of violent actions or attacks
New Auto-Interp
Negative Logits
DragonMagazine
-0.92
imester
-0.81
inventoryQuantity
-0.76
iterranean
-0.72
iHUD
-0.70
Factor
-0.69
Conversation
-0.68
arenthood
-0.67
glers
-0.66
YC
-0.66
POSITIVE LOGITS
senseless
1.23
unconscious
1.10
merciless
1.04
harder
1.02
violently
1.02
repeatedly
1.01
inappropriately
0.98
furiously
0.95
hard
0.93
hardest
0.91
Activations Density 0.154%