INDEX
Explanations
proper nouns and significant identifiers
New Auto-Interp
Negative Logits
itſelf
-1.46
myſelf
-1.32
themſelves
-1.26
Efq
-1.25
Majefty
-1.21
poffible
-1.21
iſt
-1.19
pleaſure
-1.16
ſeveral
-1.13
auffi
-1.10
POSITIVE LOGITS
Mc
0.92
La
0.89
Van
0.84
De
0.83
Von
0.80
Le
0.78
van
0.73
Vanden
0.72
Mac
0.70
Las
0.69
Activations Density 0.158%