INDEX
Explanations
numerical references, particularly related to age or quantities
New Auto-Interp
Negative Logits
es
-0.84
/
-0.82
)
-0.70
er
-0.67
Orrell
-0.66
def
-0.64
“
-0.63
ly
-0.63
︎
-0.62
>
-0.62
POSITIVE LOGITS
eighty
1.20
seventy
1.14
ninety
1.12
sixty
1.12
fifty
1.07
Theſe
1.06
twenty
1.05
nineteen
1.04
forty
1.04
thirty
1.03
Activations Density 0.125%