INDEX
Explanations
phrases indicating progression or increasing intensity in various contexts
New Auto-Interp
Negative Logits
ongo
-0.15
ordin
-0.15
pit
-0.15
816
-0.15
ä¿Ĥ
-0.14
elt
-0.14
Richards
-0.14
ehir
-0.13
_initializer
-0.13
йн
-0.13
POSITIVE LOGITS
increasingly
0.18
ubo
0.16
cci
0.15
è¶Ĭ
0.15
ä¸ģ
0.15
_until
0.14
é«
0.14
Bd
0.14
UBL
0.14
pai
0.14
Activations Density 0.193%