INDEX
Explanations
specific characters and symbols used in programming or scripting contexts
New Auto-Interp
Negative Logits
<eos>
-0.79
↵
-0.77
(
-0.74
=
-0.69
(
-0.68
=
-0.68
dymyr
-0.68
>(</
-0.66
__(
-0.65
r
-0.65
POSITIVE LOGITS
leſs
1.36
myſelf
1.26
himſelf
1.25
itſelf
1.23
$_"
1.22
Theſe
1.20
ſelves
1.15
Anſ
1.14
neſs
1.14
themſelves
1.13
Activations Density 0.459%