INDEX
Explanations
instances of the word "end" and its variations in various contexts
termination sequences
New Auto-Interp
Negative Logits
queſta
-0.98
ſind
-0.94
$_(
-0.87
ſch
-0.86
ſche
-0.85
ſta
-0.84
ſtand
-0.82
transQ
-0.81
ſelf
-0.80
<unused14>
-0.79
POSITIVE LOGITS
↵
0.33
mesma
0.28
ostante
0.26
lecteur
0.26
j
0.25
↵↵
0.24
poptotic
0.24
meu
0.24
Empereur
0.24
.
0.24
Activations Density 0.002%