INDEX
Explanations
mathematical symbols and formatting elements within equations
math and code symbols
New Auto-Interp
Negative Logits
Olvid
-0.44
gustó
-0.29
、
-0.28
supuestamente
-0.28
anunci
-0.27
originalmente
-0.27
образом
-0.26
değiştir
-0.26
znám
-0.26
ника
-0.26
POSITIVE LOGITS
zwiſchen
0.97
ſans
0.94
ſſung
0.93
niſſe
0.92
ロウィン
0.92
ſicht
0.91
<unused28>
0.91
<unused51>
0.91
[@BOS@]
0.91
<pad>
0.91
Activations Density 0.113%