INDEX
Explanations
the abbreviation for minutes ("min") when it refers to a duration of time in an exercise or experimental context.
New Auto-Interp
Negative Logits
min
-1.36
min
-1.08
Min
-1.02
Min
-0.98
...
-0.93
,
-0.79
minimum
-0.77
"
-0.73
a
-0.73
the
-0.71
POSITIVE LOGITS
Monfieur
1.77
myſelf
1.73
Efq
1.70
Majefty
1.63
Houſe
1.59
houſe
1.56
Theſe
1.49
Jefus
1.46
becauſe
1.46
itſelf
1.45
Activations Density 0.720%