INDEX
Explanations
occurrences of the newline character and variable names in programming context
New Auto-Interp
Negative Logits
d
-0.16
gard
-0.15
er
-0.15
dale
-0.15
gon
-0.15
Ling
-0.15
da
-0.14
ow
-0.14
ized
-0.14
del
-0.14
POSITIVE LOGITS
\n
0.32
\t
0.29
{}\0.20
\r
0.20
#\
0.19
\xe
0.16
odia
0.16
\uD
0.15
.bz
0.15
\u
0.15
Activations Density 0.011%