INDEX
Explanations
dates in the month of January
the occurrences of the word "January" and specific dates within that month
New Auto-Interp
Negative Logits
estern
-0.75
diaper
-0.72
phys
-0.72
atic
-0.72
Reviewer
-0.69
ographed
-0.65
inances
-0.64
andem
-0.61
aird
-0.61
inance
-0.60
POSITIVE LOGITS
January
1.01
2017
0.94
2019
0.94
2015
0.94
January
0.93
ruary
0.93
Feb
0.93
July
0.91
2018
0.90
June
0.89
Activations Density 0.015%