INDEX
Explanations
descriptions of intense emotional or physical conflicts
situations involving conflict or aggression
New Auto-Interp
Negative Logits
20439
-0.87
osate
-0.73
Consortium
-0.69
izen
-0.69
ographies
-0.68
administrations
-0.67
immortality
-0.66
izons
-0.65
inction
-0.65
natureconservancy
-0.64
POSITIVE LOGITS
yelling
1.12
altercation
1.05
violently
0.97
agitated
0.95
shouting
0.95
disorderly
0.93
angrily
0.93
verbally
0.93
prompting
0.90
ensued
0.89
Activations Density 0.278%