INDEX
Explanations
phrases related to taking action or making progress
New Auto-Interp
Negative Logits
¨
-0.16
ecta
-0.16
tid
-0.16
bou
-0.15
strt
-0.14
ipple
-0.14
ntag
-0.14
jes
-0.14
rtl
-0.14
enna
-0.14
POSITIVE LOGITS
Steph
0.19
-step
0.19
step
0.19
step
0.18
.step
0.18
Step
0.18
stepped
0.17
FOOT
0.17
toes
0.16
stepping
0.16
Activations Density 0.022%