INDEX
Explanations
specific punctuation marks and sentence structures that indicate emotional or dramatic emphasis
New Auto-Interp
Negative Logits
Leaks
-0.18
orman
-0.16
âĨĴ↵↵
-0.16
Liberation
-0.15
anton
-0.15
rien
-0.15
Ñįй
-0.15
leys
-0.14
ź
-0.14
OKIE
-0.14
POSITIVE LOGITS
astro
0.15
ipur
0.14
овиÑĩ
0.14
ativo
0.13
μÏĨ
0.13
aret
0.13
ravel
0.13
0.13
igon
0.13
поÑģÑĤанов
0.13
Activations Density 0.508%