INDEX
Explanations
actions involving physical movements, particularly ones related to arms and hands
actions involving physical contact or impact
New Auto-Interp
Negative Logits
CLUS
-0.87
suites
-0.77
conformity
-0.76
undown
-0.68
Bast
-0.68
redundancy
-0.67
eming
-0.66
pandemonium
-0.65
fam
-0.65
士
-0.65
POSITIVE LOGITS
gently
1.07
fingers
1.01
fingertips
0.99
angled
0.97
tip
0.95
paddle
0.92
button
0.91
downwards
0.91
tips
0.90
pressed
0.90
Activations Density 0.351%