INDEX
Explanations
mathematical expressions or notations
Numbers immediately after numbers
listing items numerically
New Auto-Interp
Negative Logits
<eos>
-0.85
↵↵
-0.73
↵
-0.70
↵↵↵
-0.69
,
-0.58
’
-0.55
par
-0.54
pal
-0.54
”.
-0.52
pol
-0.51
POSITIVE LOGITS
abestanden
0.82
་་
0.79
setcounter
0.78
Sodom
0.77
ovp
0.76
getItemId
0.74
Wittgenstein
0.73
Reſ
0.73
Etrus
0.73
Anſ
0.71
Activations Density 0.003%