INDEX
Explanations
numeric values and mathematical expressions
New Auto-Interp
Negative Logits
Majefty
-1.81
Jefus
-1.79
Efq
-1.79
Monfieur
-1.73
purpoſe
-1.69
Reſ
-1.69
pleaſure
-1.67
Theſe
-1.66
myſelf
-1.65
uſed
-1.62
POSITIVE LOGITS
of
0.82
in
0.71
I
0.70
<bos>
0.70
-
0.69
'
0.68
ma
0.68
E
0.67
/
0.67
"
0.66
Activations Density 1.694%