INDEX
Explanations
phrases related to time, particularly mentioning a specific week in a context
references to the current week or timeframe in the context of events or updates
New Auto-Interp
Negative Logits
Gems
-0.70
bler
-0.69
itialized
-0.67
ership
-0.67
izoph
-0.66
ected
-0.62
vim
-0.62
gary
-0.61
rification
-0.58
emort
-0.57
POSITIVE LOGITS
days
1.27
night
1.09
DAY
0.93
afternoon
0.84
mornings
0.80
orrow
0.76
flower
0.74
morning
0.72
artment
0.70
bush
0.70
Activations Density 0.056%