INDEX
Explanations
dates and references to specific events or time periods
New Auto-Interp
Negative Logits
Halloween
-0.17
Christmas
-0.16
Sept
-0.16
Winter
-0.15
christmas
-0.15
cient
-0.15
åĨ¬
-0.15
Oct
-0.15
ateg
-0.14
oct
-0.14
POSITIVE LOGITS
May
0.24
May
0.23
ilinear
0.17
amoto
0.17
June
0.16
nard
0.16
-May
0.16
procs
0.16
erk
0.15
oral
0.15
Activations Density 0.028%