INDEX
Explanations
words related to violence
references to violence or violent acts
New Auto-Interp
Negative Logits
Premium
-0.81
Pod
-0.77
DK
-0.74
rin
-0.72
Boost
-0.72
TTL
-0.71
Sparkle
-0.71
elle
-0.71
Fleet
-0.69
Labs
-0.69
POSITIVE LOGITS
violent
3.37
violent
2.65
Violent
2.33
violence
2.08
nonviolent
2.03
violence
1.91
violently
1.79
Viol
1.76
murderous
1.71
Violence
1.65
Activations Density 0.020%