INDEX
Explanations
words related to violence and killing
instances of the word "kill" and its variations
New Auto-Interp
Negative Logits
ational
-0.79
BuyableInstoreAndOnline
-0.77
herty
-0.76
Lich
-0.65
kj
-0.63
agall
-0.62
Celest
-0.62
Sparkle
-0.61
âĵĺ
-0.59
bles
-0.59
POSITIVE LOGITS
spree
1.00
switch
0.98
mails
0.94
joy
0.90
houses
0.89
killer
0.89
blow
0.86
uminati
0.86
bird
0.84
instinct
0.83
Activations Density 0.089%