INDEX
Negative Logits
r
-0.08
-
-0.08
“
-0.08
you
-0.08
"
-0.08
I
-0.08
#
-0.08
sort
-0.08
\n
-0.08
="
-0.07
POSITIVE LOGITS
the
0.17
The
0.13
The
0.12
THE
0.12
_the
0.11
the
0.11
The
0.10
.The
0.10
THE
0.10
>The
0.09
Activations Density 3.768%
r
-
“
you
"
I
#
sort
\n
="
the
The
The
THE
_the
the
The
.The
THE
>The