INDEX
Negative Logits
Efq
-1.34
Monfieur
-1.30
greateſt
-1.28
ſeveral
-1.24
myſelf
-1.23
Anſ
-1.22
pleaſure
-1.22
reaſon
-1.20
Reſ
-1.19
purpoſe
-1.19
POSITIVE LOGITS
0.74
,
0.73
s
0.71
is
0.66
in
0.65
of
0.64
.
0.64
(
0.63
ic
0.61
↵↵
0.60
Activations Density 0.445%