INDEX
Explanations
references to the day of the week "Monday"
instances of the word "Monday."
New Auto-Interp
Negative Logits
ript
-1.17
onductor
-0.96
iframe
-0.87
ordes
-0.87
keye
-0.87
umbn
-0.84
ris
-0.80
egal
-0.79
abet
-0.79
respons
-0.76
POSITIVE LOGITS
morning
1.59
afternoon
1.52
night
1.43
mornings
1.42
evening
1.40
Night
1.26
morning
1.18
nights
1.17
evenings
1.08
Morning
1.05
Activations Density 0.030%