INDEX
Negative Logits
end
-1.88
rest
-1.15
art
-1.15
-1.15
des
-1.08
est
-1.05
de
-1.03
(
-1.02
in
-0.98
E
-0.97
POSITIVE LOGITS
Efq
2.48
myſelf
2.39
itſelf
2.34
Monfieur
2.33
houſe
2.23
Houſe
2.23
Theſe
2.22
purpoſe
2.20
pleaſure
2.16
ſeveral
2.13
Activations Density 0.149%