INDEX
Explanations
references to specific body parts and injuries
New Auto-Interp
Negative Logits
orp
-0.18
claws
-0.15
atab
-0.15
odate
-0.15
seins
-0.15
Hands
-0.14
_heads
-0.14
bject
-0.14
oose
-0.14
antis
-0.14
POSITIVE LOGITS
leg
0.30
left
0.27
right
0.26
temple
0.25
arm
0.24
index
0.20
rib
0.20
shin
0.20
foot
0.20
fem
0.20
Activations Density 0.077%