INDEX
Explanations
references to specific individuals or entities, particularly those with the prefix "De."
New Auto-Interp
Negative Logits
itſelf
-0.94
Efq
-0.92
leaſt
-0.82
Chriſt
-0.80
Houſe
-0.80
faſt
-0.77
Majefty
-0.77
raiſ
-0.75
―――――
-0.74
poffible
-0.74
POSITIVE LOGITS
De
2.90
De
2.65
Де
1.21
DeV
1.12
DeL
1.01
Де
0.98
DeWitt
0.91
Di
0.90
Del
0.89
Dever
0.88
Activations Density 0.081%