INDEX
Explanations
references to the body part "shoulder"
references to the word "shoulder."
New Auto-Interp
Negative Logits
yrinth
-0.99
eer
-0.92
eers
-0.85
esis
-0.84
inction
-0.82
antis
-0.78
ktop
-0.78
icative
-0.77
nom
-0.73
leon
-0.73
POSITIVE LOGITS
blades
1.13
shoulder
0.95
bone
0.92
blade
0.87
shrug
0.87
straps
0.85
lobe
0.81
strap
0.79
bones
0.78
pless
0.78
Activations Density 0.030%