INDEX
Explanations
phrases related to violent actions
phrases related to violence and aggression
New Auto-Interp
Negative Logits
Tycoon
-0.78
Horus
-0.72
charms
-0.72
Europa
-0.70
Scotia
-0.70
modification
-0.69
Confederation
-0.68
oller
-0.68
Daniels
-0.68
Vaughn
-0.67
POSITIVE LOGITS
paren
1.16
bodied
1.10
up
1.03
edge
1.02
out
1.01
by
1.00
average
0.99
upon
0.98
looking
0.97
together
0.97
Activations Density 0.051%