INDEX
Explanations
calls to action or prompts for engagement
New Auto-Interp
Negative Logits
/from
-0.15
ýt
-0.15
yo
-0.15
лим
-0.14
_:
-0.13
"crypto
-0.13
viously
-0.13
ymous
-0.13
Scho
-0.13
antic
-0.13
POSITIVE LOGITS
now
0.27
Now
0.25
_now
0.23
now
0.21
اÙĦØ¢ÙĨ
0.20
Now
0.20
-now
0.20
your
0.19
agora
0.19
ÑģейÑĩаÑģ
0.18
Activations Density 0.125%