INDEX
Explanations
instances of "hands-on" activities or experiences
New Auto-Interp
Negative Logits
оÑĥ
-0.17
ailable
-0.16
Å¡ÃŃ
-0.15
.sponge
-0.15
htable
-0.14
cctor
-0.14
šak
-0.14
ilik
-0.14
alto
-0.14
adors
-0.14
POSITIVE LOGITS
-on
0.28
dirty
0.27
-down
0.24
Dirty
0.23
hands
0.23
dirty
0.23
free
0.22
down
0.22
-off
0.21
-free
0.21
Activations Density 0.007%