INDEX
Explanations
descriptions of violent events and their aftermath
New Auto-Interp
Negative Logits
èħ
-0.15
yro
-0.14
erture
-0.14
hab
-0.13
776
-0.13
tied
-0.13
coil
-0.13
ipt
-0.13
pockets
-0.13
Honor
-0.13
POSITIVE LOGITS
impact
0.30
Impact
0.27
Impact
0.25
impact
0.24
impacts
0.23
hitting
0.23
hit
0.22
HIT
0.20
collision
0.20
hard
0.20
Activations Density 0.113%