INDEX
Explanations
words related to aggression
instances of the term "aggro" and its variations, indicating a focus on aggression or conflict
New Auto-Interp
Negative Logits
phrine
-0.72
obser
-0.71
Gemini
-0.68
sights
-0.68
nsic
-0.66
fulness
-0.65
vironment
-0.65
enactment
-0.63
isation
-0.60
ological
-0.59
POSITIVE LOGITS
regate
1.36
regation
1.07
rieved
1.06
aign
0.91
entric
0.87
alos
0.87
orno
0.86
staff
0.84
les
0.82
arette
0.80
Activations Density 0.042%