INDEX
Explanations
standard terms and acronyms, especially in technical or scientific contexts
capital letters at the beginning of words
New Auto-Interp
Negative Logits
myſelf
-1.24
Theſe
-1.23
itſelf
-1.23
Reſ
-1.17
Houſe
-1.16
Majefty
-1.12
ſelves
-1.10
Efq
-1.09
―――――
-1.06
ſelf
-1.04
POSITIVE LOGITS
M
1.02
H
1.01
D
0.98
L
0.97
G
0.96
F
0.95
S
0.91
R
0.90
W
0.90
T
0.89
Activations Density 15.900%