INDEX
Explanations
notations or formatting elements within the text
language codes or user prefixes
New Auto-Interp
Negative Logits
ſicht
-1.00
queſta
-1.00
ſch
-0.98
<unused16>
-0.96
<unused8>
-0.96
[@BOS@]
-0.96
<unused41>
-0.96
<unused43>
-0.96
<unused74>
-0.96
<pad>
-0.96
POSITIVE LOGITS
↵
0.56
1
0.43
_
0.42
0.42
2
0.41
0.40
.
0.40
S
0.38
0.37
*
0.36
Activations Density 0.000%