INDEX
Explanations
repeated mentions of the word "the"
New Auto-Interp
Negative Logits
Paglinawan
-1.49
itſelf
-1.21
myſelf
-1.21
doubtnut
-1.19
kaarangay
-1.18
Geplaatst
-1.18
resourceCulture
-1.17
Autoritní
-1.12
Савезне
-1.10
Anſ
-1.09
POSITIVE LOGITS
,
1.04
.
0.95
0.93
<eos>
0.86
?
0.85
0.81
...
0.80
:
0.80
;
0.79
(
0.78
Activations Density 0.473%