INDEX
Explanations
activities related to exerting effort or pressure
variations of the word "push."
New Auto-Interp
Negative Logits
abol
-0.72
Interstitial
-0.70
Recogn
-0.70
omial
-0.68
Receiver
-0.66
uster
-0.65
abad
-0.64
enfranch
-0.59
Seym
-0.59
Chaff
-0.58
POSITIVE LOGITS
chairs
0.91
pushing
0.85
push
0.85
pushed
0.82
back
0.82
push
0.78
harder
0.77
boxes
0.77
toward
0.77
forward
0.76
Activations Density 0.031%