INDEX
Explanations
phrases related to the concept of slowness or gradualness
New Auto-Interp
Negative Logits
ulet
-0.16
-ÑĤо
-0.16
;;;;;;;;
-0.14
ignon
-0.14
xbb
-0.14
antar
-0.14
reffen
-0.14
apologies
-0.14
butt
-0.14
пÑĢа
-0.14
POSITIVE LOGITS
æħ¢
0.21
Slow
0.20
slow
0.20
Slow
0.20
_slow
0.19
slow
0.17
slower
0.17
UFFIX
0.16
/fast
0.16
liest
0.15
Activations Density 0.029%