INDEX
Explanations
beginning and headers of textual content
with non-standard or archaic spellings
old english spellings
New Auto-Interp
Negative Logits
q
-0.53
c
-0.46
y
-0.45
pem
-0.44
o
-0.44
h
-0.44
qu
-0.42
ph
-0.42
z
-0.41
vu
-0.41
POSITIVE LOGITS
myſelf
1.30
themſelves
1.25
Monfieur
1.23
itſelf
1.22
himſelf
1.21
purpoſe
1.11
becauſe
1.07
ſever
1.04
pleaſure
1.03
iſt
1.00
Activations Density 0.347%