INDEX
Explanations
dates specified in a particular format (for example, "June 20" expressed as "six/20")
specific dates related to events
New Auto-Interp
Negative Logits
orable
-0.73
ensed
-0.63
rw
-0.63
attled
-0.62
akespe
-0.61
wx
-0.60
scissors
-0.59
edIn
-0.58
loves
-0.57
hygiene
-0.56
POSITIVE LOGITS
th
1.30
ths
0.89
TH
0.88
rd
0.85
teenth
0.82
Forth
0.73
âĸĪâĸĪ
0.70
onwards
0.68
eteenth
0.65
2200
0.65
Activations Density 0.089%