INDEX
Explanations
mentions of specific weeks
occurrences of the word "week" in various contexts
New Auto-Interp
Negative Logits
ggle
-0.69
erate
-0.61
vae
-0.59
gery
-0.59
gary
-0.59
catch
-0.58
bler
-0.57
ries
-0.57
itialized
-0.56
ventus
-0.56
POSITIVE LOGITS
night
0.84
afternoon
0.76
days
0.76
marked
0.72
flower
0.68
nown
0.68
alez
0.66
evening
0.66
iphany
0.64
bush
0.63
Activations Density 0.078%