INDEX
Negative Logits
pleaſure
-1.52
myſelf
-1.48
purpoſe
-1.48
Anſ
-1.47
Monfieur
-1.46
ſeveral
-1.40
reaſon
-1.38
ſtate
-1.37
itſelf
-1.36
Reſ
-1.34
POSITIVE LOGITS
or
0.87
for
0.84
and
0.84
dem
0.84
(
0.82
b
0.81
vol
0.81
in
0.80
an
0.80
a
0.80
Activations Density 0.071%