INDEX
Explanations
phrases related to the highest and lowest values, especially in a mathematical or comparative context
New Auto-Interp
Negative Logits
an
-0.96
er
-0.92
p
-0.87
o
-0.86
u
-0.84
a
-0.83
k
-0.83
r
-0.82
q
-0.77
Man
-0.75
POSITIVE LOGITS
beſt
1.34
greateſt
1.29
]")]
1.28
myſelf
1.27
leaſt
1.25
firſt
1.23
་་
1.21
apest
1.19
Majefty
1.16
</tfoot>
1.15
Activations Density 0.052%