INDEX
Explanations
punctuation marks and certain short-format text features
New Auto-Interp
Negative Logits
”]
-0.66
<bos>
-0.64
?'
-0.61
"}>
-0.57
localctx
-0.55
thâu
-0.55
”“
-0.55
.”)
-0.54
"")
-0.52
'}>
-0.52
POSITIVE LOGITS
détru
0.90
morire
0.87
définiti
0.79
menac
0.79
détruit
0.78
danni
0.78
chré
0.77
quæ
0.77
espirituales
0.76
ennemis
0.76
Activations Density 1.050%