INDEX
Explanations
discourse markers indicating that someone is stepping through a mathematical argument, often checking work
New Auto-Interp
Negative Logits
è¿Ļæĺ¯
-0.07
-that
-0.07
thats
-0.07
roi
-0.06
itoris
-0.06
yes
-0.06
sounds
-0.06
nothing
-0.06
ìĿ´ëĬĶ
-0.06
hå
-0.06
POSITIVE LOGITS
now
0.13
Now
0.13
Now
0.12
maintenant
0.10
_now
0.10
çİ°åľ¨
0.09
ahora
0.09
now
0.09
teÄı
0.09
now
0.09
Activations Density 0.151%