INDEX
Explanations
references to days of the week and time-related phrases connected to schedules or routines
New Auto-Interp
Negative Logits
illian
-0.17
akter
-0.15
Apr
-0.15
ythe
-0.14
Jun
-0.14
sep
-0.14
jun
-0.14
Apr
-0.14
Jun
-0.14
mornings
-0.14
POSITIVE LOGITS
Frid
0.22
Wednesday
0.22
Fir
0.22
Mond
0.21
Friday
0.20
Saturday
0.19
Sunday
0.19
Thursday
0.19
unday
0.18
aturday
0.18
Activations Density 0.098%