INDEX
Explanations
dates mentioned in a text
references to specific days and their significance in the context of events
New Auto-Interp
Negative Logits
amily
-0.69
erm
-0.66
eleg
-0.66
unequ
-0.65
err
-0.63
bil
-0.62
woven
-0.61
uffy
-0.61
oplan
-0.61
enriched
-0.61
POSITIVE LOGITS
aneously
0.85
nings
0.85
BUS
0.79
Fax
0.78
holidays
0.76
actionDate
0.76
holiday
0.73
realDonaldTrump
0.71
DAQ
0.68
slaught
0.68
Activations Density 0.040%