INDEX
Explanations
dates and days of the week in text
occurrences of the word "on" indicating specific dates or events
New Auto-Interp
Negative Logits
ynt
-0.72
arat
-0.71
anan
-0.69
$$
-0.68
oler
-0.66
arth
-0.64
utter
-0.62
aturated
-0.61
uum
-0.61
aults
-0.60
POSITIVE LOGITS
behalf
1.45
Thursday
1.09
Wednesday
1.05
Monday
1.03
shore
1.02
Friday
1.02
September
1.01
etime
1.01
Tuesday
1.00
erous
0.99
Activations Density 0.182%