INDEX
Explanations
special characters or non-standard symbols in the text
New Auto-Interp
Negative Logits
...
-0.17
â̝
-0.17
-0.16
...↵↵
-0.15
--
-0.15
ÂŃ
-0.15
-
-0.14
--
-0.14
...
-0.13
âĢī
-0.13
POSITIVE LOGITS
:-↵
0.23
;'↵
0.18
;-
0.17
Freem
0.17
:;↵
0.16
hint
0.16
:]↵
0.16
;↵
0.15
?;↵
0.15
pointers
0.15
Activations Density 0.003%