INDEX
Explanations
punctuation marks and formatting indicators in the text
New Auto-Interp
Negative Logits
(
-0.17
nbsp
-0.15
...]
-0.15
oldown
-0.15
$MESS
-0.14
ãĥªãĥ³ãĤ°
-0.14
=-=-=-=-=-=-=-=-
-0.14
%s
-0.14
...(
-0.14
taboola
-0.14
POSITIVE LOGITS
‘
0.19
...)↵
0.17
or
0.16
aka
0.16
±
0.16
--)
0.15
http
0.15
^^
0.15
__)
0.15
,)
0.14
Activations Density 0.437%