INDEX
Explanations
dates and days of the week mentioned in a text
mentions of specific days of the week, particularly Thursdays
New Auto-Interp
Negative Logits
umbn
-0.82
respons
-0.72
igated
-0.70
onductor
-0.69
inctions
-0.67
ection
-0.67
����
-0.66
agos
-0.66
rust
-0.66
este
-0.66
POSITIVE LOGITS
nights
1.15
mornings
1.14
morning
1.08
afternoon
1.00
night
0.99
evenings
0.99
Night
0.99
NIGHT
0.93
evening
0.92
days
0.91
Activations Density 0.089%