INDEX
Explanations
days of the week, specifically "Tuesday"
instances of the day of the week, specifically Tuesday
New Auto-Interp
Negative Logits
abet
-0.95
ript
-0.90
luster
-0.83
egal
-0.79
respons
-0.76
ordes
-0.76
onductor
-0.75
keye
-0.72
agos
-0.72
ebook
-0.71
POSITIVE LOGITS
morning
1.48
afternoon
1.37
night
1.31
evening
1.29
mornings
1.20
Night
1.07
morning
1.06
nights
1.04
evenings
0.97
Morning
0.94
Activations Density 0.041%