INDEX
Explanations
mentions of events or activities that occur in the evening
references to "evening" and related contexts
New Auto-Interp
Negative Logits
elsen
-0.88
ilage
-0.88
cho
-0.79
hee
-0.78
lessly
-0.76
eus
-0.75
ocide
-0.74
control
-0.74
free
-0.72
uilt
-0.72
POSITIVE LOGITS
Evening
0.83
evenings
0.78
evening
0.77
afternoon
0.75
Afric
0.73
gown
0.71
Werewolf
0.70
Shad
0.69
hours
0.68
twilight
0.68
Activations Density 0.010%