INDEX
Negative Logits
Efq
-1.42
Diſ
-1.35
Monfieur
-1.35
Theſe
-1.34
itſelf
-1.31
Anſ
-1.30
Jefus
-1.28
Reſ
-1.27
Majefty
-1.27
ſelf
-1.24
POSITIVE LOGITS
of
0.81
in
0.68
a
0.66
the
0.65
can
0.60
(
0.57
la
0.57
I
0.57
He
0.57
he
0.56
Activations Density 1.597%