INDEX
Explanations
mathematical notation, specifically related to dimensionality and dimensional bounds
New Auto-Interp
Negative Logits
Picchu
-0.70
ValueStyle
-0.69
']
-0.68
"',
-0.67
skri
-0.67
'}
-0.67
ing
-0.66
^{(-0.65
μην
-0.64
Bain
-0.64
POSITIVE LOGITS
niggas
0.77
fucks
0.76
Pohl
0.76
̉
0.72
pulsante
0.70
0.69
Bernhard
0.68
стихи
0.68
pylint
0.67
yā
0.67
Activations Density 0.015%