INDEX
Explanations
structural elements of written text or programming code
New Auto-Interp
Negative Logits
[…]
-1.28
[…]
-1.01
…
-1.00
</em>
-0.83
[...]
-0.83
</strong>
-0.80
…"
-0.78
..."
-0.77
."
-0.76
.”
-0.74
POSITIVE LOGITS
Савезне
1.07
:✨
0.87
Personensuche
0.85
<bos>
0.85
autorytatywna
0.81
ніципа
0.79
twimg
0.76
Roskov
0.75
Numerade
0.73
Administrativna
0.72
Activations Density 0.081%