INDEX
Explanations
phrases that indicate an active process of progression or change
New Auto-Interp
Negative Logits
ucks
-0.19
asket
-0.15
inyin
-0.15
antt
-0.14
assen
-0.14
downt
-0.14
aurus
-0.13
Pres
-0.13
*</
-0.13
ottie
-0.13
POSITIVE LOGITS
illas
0.17
chal
0.16
_mC
0.16
Malk
0.14
å§ĭ
0.14
¾
0.14
Tubes
0.14
STEP
0.14
åľ
0.14
_mD
0.13
Activations Density 0.036%