INDEX
Explanations
references to specific time periods in relation to events or occurrences
occurrences of the word "week" to indicate recent events or updates
New Auto-Interp
Negative Logits
vim
-0.67
bler
-0.67
itialized
-0.64
izoph
-0.64
ventus
-0.62
ership
-0.61
Gems
-0.61
gary
-0.61
pend
-0.59
omething
-0.58
POSITIVE LOGITS
days
1.19
night
1.08
afternoon
0.90
DAY
0.78
morning
0.76
evening
0.76
mornings
0.75
ç¥ŀ
0.71
atell
0.71
bush
0.70
Activations Density 0.051%