INDEX
Explanations
occurrences of mathematical expressions involving numeric values, particularly those with dollar signs, exponents, and equal signs
New Auto-Interp
Negative Logits
itſelf
-1.40
myſelf
-1.30
Efq
-1.21
Jefus
-1.20
―――――
-1.20
Houſe
-1.20
tvguidetime
-1.19
Monfieur
-1.18
Majefty
-1.16
$_"
-1.16
POSITIVE LOGITS
1.01
$
0.99
$
0.98
.
0.86
<eos>
0.82
$\
0.78
'
0.78
$\
0.73
__))
0.71
↵↵
0.69
Activations Density 0.524%