INDEX
Explanations
references to mathematical or logical structures, particularly related to equations and proofs
New Auto-Interp
Negative Logits
echa
-0.15
ĥģ
-0.14
%C
-0.14
ÏįÏĢ
-0.14
#",
-0.13
949
-0.12
-na
-0.12
519
-0.12
:</
-0.12
Pace
-0.12
POSITIVE LOGITS
$_
0.36
$
0.35
)$_
0.29
$\
0.27
{$0.24
${0.24
~=
0.23
$/
0.23
'$
0.22
($
0.20
Activations Density 0.057%