INDEX
Explanations
Japanese expressions and musical notations
repeated words and punctuation
New Auto-Interp
Negative Logits
المعيارى
-1.16
<unused3>
-1.04
<unused16>
-1.04
<unused42>
-1.04
<unused43>
-1.04
[@BOS@]
-1.04
<unused8>
-1.04
<unused41>
-1.04
<unused51>
-1.04
<unused28>
-1.04
POSITIVE LOGITS
↵↵
0.35
0.31
↵
0.30
<eos>
0.27
.
0.27
↵↵↵
0.25
Collections
0.25
.,
0.25
。
0.25
!
0.23
Activations Density 0.033%