INDEX
Explanations
phrases that denote an effort to advance or challenge limits
New Auto-Interp
Negative Logits
سد
-0.16
uada
-0.16
throp
-0.15
ezier
-0.15
jac
-0.15
uvo
-0.15
uco
-0.15
ebek
-0.14
eker
-0.14
bsolute
-0.14
POSITIVE LOGITS
aside
0.34
-button
0.31
button
0.30
buttons
0.29
buttons
0.28
back
0.28
BUTTON
0.28
boundaries
0.27
forward
0.25
harder
0.25
Activations Density 0.036%