INDEX
Explanations
mathematical symbols and notations, particularly involving dollar signs indicating equations or variables
New Auto-Interp
Negative Logits
iſt
-0.96
Reſ
-0.90
Eſ
-0.86
Anſ
-0.85
ſelves
-0.85
ly
-0.83
Inſ
-0.81
ſy
-0.80
••••
-0.80
ſind
-0.78
POSITIVE LOGITS
\}$
1.09
}}$
1.07
}$
1.05
]$
1.04
}]$
1.03
)$
1.03
)}$
1.03
}\}$
1.03
)}$
1.02
]}$
1.02
Activations Density 0.495%