INDEX
Explanations
words related to movement and transitions
New Auto-Interp
Negative Logits
ark
-0.16
nder
-0.15
uilt
-0.15
ilder
-0.14
bas
-0.14
atas
-0.14
alo
-0.14
kem
-0.14
alb
-0.14
propos
-0.14
POSITIVE LOGITS
into
0.32
onto
0.30
into
0.26
onto
0.22
Into
0.21
back
0.21
naar
0.20
vÃło
0.19
à¹Ħà¸Ľà¸¢
0.19
Into
0.19
Activations Density 0.167%