INDEX
Explanations
punctuation marks, particularly periods and ellipses
New Auto-Interp
Negative Logits
“
-1.50
-1.26
,
-1.26
"
-1.23
(
-1.18
/
-1.14
-1.10
or
-1.03
.
-1.02
:
-1.01
POSITIVE LOGITS
Efq
2.71
).'
2.65
myſelf
2.48
?'
2.42
itſelf
2.38
Monfieur
2.34
)':
2.30
.’”
2.29
!'
2.28
)'
2.27
Activations Density 0.230%