INDEX
Explanations
occurrences of the word "week" in the text
references to time periods, specifically "this week."
New Auto-Interp
Negative Logits
bler
-0.68
itialized
-0.65
Control
-0.65
erate
-0.64
ership
-0.64
ospel
-0.63
ggle
-0.62
Compan
-0.62
ventus
-0.61
gery
-0.60
POSITIVE LOGITS
night
0.96
afternoon
0.93
days
0.92
evening
0.81
morning
0.77
flower
0.77
night
0.73
marked
0.72
ada
0.70
atell
0.69
Activations Density 0.058%