INDEX
Explanations
phrases related to physical actions or attributes involving hands
phrases related to open-mindedness and thinking flexibility
New Auto-Interp
Negative Logits
Reloaded
-0.58
iability
-0.58
ãĥ´ãĤ¡
-0.57
stall
-0.57
Brav
-0.57
soDeliveryDate
-0.55
ront
-0.55
MpServer
-0.54
segregation
-0.53
ripp
-0.53
POSITIVE LOGITS
hands
2.02
hand
2.00
eyes
1.86
fingers
1.83
eye
1.80
fingertips
1.69
lips
1.69
paws
1.65
mouth
1.64
finger
1.61
Activations Density 0.481%