INDEX
Explanations
punctuation marks and symbols used in code or formatting
New Auto-Interp
Negative Logits
)
-0.45
-
-0.44
.
-0.43
/
-0.41
“
-0.41
’
-0.40
]
-0.39
:
-0.39
↵
-0.36
–
-0.36
POSITIVE LOGITS
"/",
1.77
"*",
1.73
'*',
1.70
"",
1.69
'/',
1.66
'',
1.62
[],
1.62
{},
1.61
"",
1.60
)))),
1.60
Activations Density 0.191%