INDEX
Explanations
mentions of specific body parts being injured
New Auto-Interp
Negative Logits
Authors
-0.69
invitations
-0.63
discounts
-0.61
Administ
-0.61
constraints
-0.60
lobbyists
-0.60
TPP
-0.59
TIME
-0.59
DragonMagazine
-0.56
tides
-0.55
POSITIVE LOGITS
stride
1.04
abdomen
1.04
front
1.01
retaliation
0.96
cheek
0.96
thigh
0.94
chest
0.90
forehead
0.90
arms
0.88
extrem
0.87
Activations Density 0.053%