INDEX
Negative Logits
Ste
-0.82
A
-0.79
F
-0.77
,
-0.76
-0.75
R
-0.75
(
-0.75
M
-0.74
and
-0.73
as
-0.73
POSITIVE LOGITS
myſelf
1.69
ſelf
1.53
Efq
1.47
Jefus
1.45
itſelf
1.45
Theſe
1.45
ſelves
1.40
―――――
1.37
་་
1.36
Majefty
1.35
Activations Density 0.551%