INDEX
Explanations
references to dates and possibly numbers, particularly in specific contexts related to time
Numbers followed by counters or units
numbers and dates
New Auto-Interp
Negative Logits
1
-0.95
2
-0.84
0
-0.82
3
-0.81
4
-0.78
5
-0.78
6
-0.74
7
-0.73
9
-0.72
8
-0.69
POSITIVE LOGITS
eighty
1.08
seventy
1.08
forty
1.07
ninety
1.07
sixty
1.06
fifty
1.03
twenty
1.02
thirty
0.98
nineteen
0.96
eighteen
0.95
Activations Density 0.167%