INDEX
Explanations
days of the week, months, or specific dates mentioned in the text
phrases indicating specific days of the week
New Auto-Interp
Negative Logits
andel
-0.71
ANY
-0.71
GBT
-0.66
netflix
-0.66
agos
-0.66
Limit
-0.65
common
-0.65
erate
-0.64
deal
-0.64
Common
-0.64
POSITIVE LOGITS
pires
1.03
nutshell
0.88
pired
0.70
eatures
0.69
airs
0.65
Iv
0.61
ked
0.61
Flickr
0.59
Responsibility
0.59
ÃĥÃĤ
0.58
Activations Density 0.440%