INDEX
Explanations
violent or aggressive behavior, conflict, and negative interactions
New Auto-Interp
Negative Logits
ITNESS
-0.72
mberg
-0.67
ourced
-0.64
avez
-0.64
inoa
-0.64
ucket
-0.61
ovember
-0.61
ĸļ
-0.61
agine
-0.60
PsyNetMessage
-0.60
POSITIVE LOGITS
ly
1.06
ness
0.74
nesses
0.73
icious
0.70
circle
0.67
vicious
0.64
assault
0.60
havoc
0.59
retribution
0.59
bite
0.59
Activations Density 10.914%