INDEX
Explanations
punctuation marks or symbols, especially those often associated with formatting
Punctuation before specific words
New Auto-Interp
Negative Logits
myſelf
-0.96
كومونز
-0.96
PWN
-0.91
laun
-0.89
Efq
-0.87
)+"
-0.83
ſeveral
-0.83
Majefty
-0.81
ſame
-0.80
Jamestown
-0.80
POSITIVE LOGITS
いる
1.00
iwa
0.84
Humphries
0.83
sertation
0.81
出版年
0.80
Rosenthal
0.80
Ayres
0.75
sons
0.73
eniu
0.73
e
0.71
Activations Density 0.253%