INDEX
Explanations
punctuation and formatting elements within the text
New Auto-Interp
Negative Logits
disambiguazione
-0.94
الحياه
-0.84
itſelf
-0.77
NUMX
-0.76
iſt
-0.75
ſind
-0.73
Reſ
-0.73
bağlantılar
-0.72
الاطلاع
-0.72
ſche
-0.71
POSITIVE LOGITS
.
0.58
<eos>
0.54
__':
0.50
</em>
0.49
com
0.47
</strong>
0.46
co
0.45
</code>
0.44
↵↵
0.44
coration
0.44
Activations Density 0.111%