INDEX
Explanations
phrases related to striving for improvement or breaking boundaries
New Auto-Interp
Negative Logits
kt
-0.17
xit
-0.16
arser
-0.16
è¡Ĺéģĵ
-0.15
amo
-0.15
/by
-0.14
bsolute
-0.14
kir
-0.14
etimes
-0.14
avig
-0.14
POSITIVE LOGITS
limits
0.39
envelope
0.38
boundaries
0.36
Limits
0.35
envelopes
0.33
buttons
0.31
limits
0.29
aside
0.28
Limits
0.28
envelop
0.28
Activations Density 0.037%