INDEX
Explanations
references to time durations or periods
New Auto-Interp
Negative Logits
456
-0.15
somehow
-0.15
655
-0.15
frey
-0.14
today
-0.14
elli
-0.14
tomorrow
-0.14
817
-0.14
283
-0.14
ou
-0.14
POSITIVE LOGITS
dozen
0.22
ìĶ©
0.19
/month
0.18
деÑģÑıÑĤ
0.17
éĴŁ
0.16
presso
0.16
ë£Į
0.16
ãĥ¼ãĥģ
0.15
hundred
0.15
thousand
0.14
Activations Density 0.051%