INDEX
Explanations
words ending in "ist" and some related words.
New Auto-Interp
Negative Logits
raiſ
-1.23
itſelf
-1.16
faſt
-1.13
Majefty
-1.10
Efq
-1.09
Reſ
-1.06
doubtnut
-1.05
ſta
-1.05
Anſ
-1.05
myſelf
-1.04
POSITIVE LOGITS
I
0.60
.
0.60
сове
0.55
We
0.53
0.52
pember
0.52
And
0.52
répondu
0.51
usitis
0.51
udf
0.51
Activations Density 1.060%