INDEX
Explanations
sections of data separated by dividers or formatting lines
numerical entities
New Auto-Interp
Negative Logits
autorytatywna
-1.21
kasarigan
-1.16
незавершена
-1.13
queſto
-1.05
[@BOS@]
-1.05
<pad>
-1.05
<unused16>
-1.04
<unused17>
-1.04
<unused14>
-1.04
<unused3>
-1.04
POSITIVE LOGITS
-
0.51
1
0.48
2
0.45
,
0.44
↵
0.43
0
0.43
:
0.42
/
0.42
<b>
0.41
0.41
Activations Density 0.246%