INDEX
Explanations
phrases related to specific times of day, particularly evenings
references to the time of day specifically related to "evening."
New Auto-Interp
Negative Logits
Manip
-0.67
ps
-0.65
NS
-0.64
ibles
-0.61
writ
-0.60
¯
-0.60
cr
-0.59
roid
-0.59
ec
-0.58
Frames
-0.58
POSITIVE LOGITS
evening
3.61
afternoon
3.05
morning
2.63
night
2.44
evenings
2.39
Evening
2.34
morning
2.01
mornings
1.90
nights
1.72
midday
1.69
Activations Density 0.013%