INDEX
Explanations
violent actions or descriptions, including tearing, ripping, and chopping
violent or destructive actions
New Auto-Interp
Negative Logits
hof
-0.67
rophe
-0.64
lain
-0.64
Lobby
-0.63
Colomb
-0.60
lied
-0.60
MG
-0.60
Support
-0.60
ITNESS
-0.60
abilities
-0.59
POSITIVE LOGITS
away
1.03
apart
1.02
bones
0.96
awed
0.93
awa
0.92
down
0.90
nails
0.87
adoes
0.85
throats
0.84
tires
0.82
Activations Density 0.162%