INDEX
Explanations
references to historical publications and their contexts
New Auto-Interp
Negative Logits
propOrder
-0.88
Majefty
-0.88
ſelf
-0.83
ſelves
-0.80
betweenstory
-0.79
pleaſure
-0.79
itſelf
-0.78
expandindo
-0.77
faſt
-0.75
felves
-0.75
POSITIVE LOGITS
to
0.33
뽑
0.31
<eos>
0.29
,
0.29
stik
0.29
[…]
0.28
[
0.28
ar
0.28
nameof
0.27
0.27
Activations Density 0.059%