INDEX
Explanations
punctuation and formatting symbols, particularly in relation to mathematical or scientific notation
New Auto-Interp
Negative Logits
(
-0.81
>(</
-0.80
ecore
-0.77
dymyr
-0.76
<eos>
-0.75
(
-0.75
__(
-0.74
↵
-0.71
Przypisy
-0.68
_
-0.68
POSITIVE LOGITS
myſelf
1.29
Theſe
1.27
himſelf
1.24
leſs
1.21
Anſ
1.19
$_"
1.15
itſelf
1.14
ſelves
1.14
raiſ
1.13
BibitemShut
1.12
Activations Density 0.568%