INDEX
Explanations
phrases related to hands-on experiences or activities
New Auto-Interp
Negative Logits
ingen
-0.17
alie
-0.16
uler
-0.16
inger
-0.15
Å¡ÃŃ
-0.15
oth
-0.15
htable
-0.14
extreme
-0.14
alb
-0.14
_ROUT
-0.14
POSITIVE LOGITS
-on
0.38
-On
0.31
Hands
0.25
hands
0.24
dirty
0.21
Hands
0.21
-ons
0.21
_On
0.20
-down
0.20
-free
0.20
Activations Density 0.006%