INDEX
Explanations
dates specifically in the month of November
dates, specifically the occurrences of the word "November" in various contexts
New Auto-Interp
Negative Logits
jriwal
-0.89
ldon
-0.83
pires
-0.83
attendant
-0.75
king
-0.75
gypt
-0.73
gered
-0.72
gency
-0.71
ensed
-0.71
cher
-0.71
POSITIVE LOGITS
е
0.82
1942
0.81
2012
0.81
2014
0.80
1941
0.79
2015
0.78
2016
0.78
flower
0.76
2017
0.76
2010
0.75
Activations Density 0.014%