INDEX
Explanations
specific days of the week
references to specific days of the week
New Auto-Interp
Negative Logits
allo
-0.88
kef
-0.82
abet
-0.81
umbn
-0.81
Adds
-0.80
onductor
-0.78
agos
-0.75
nih
-0.74
ris
-0.72
raltar
-0.71
POSITIVE LOGITS
nights
1.19
mornings
1.12
evenings
1.09
afternoon
1.07
morning
1.04
days
1.02
night
1.01
evening
1.00
DAY
0.96
Night
0.93
Activations Density 0.092%