INDEX
Explanations
specific dates or time references
references to specific dates or timelines
New Auto-Interp
Negative Logits
raq
-0.91
ided
-0.87
estern
-0.81
ailability
-0.80
obe
-0.77
olulu
-0.77
atures
-0.76
oyd
-0.75
vernment
-0.75
uo
-0.73
POSITIVE LOGITS
dates
0.87
Dates
0.85
date
0.83
dates
0.80
enance
0.73
TBD
0.69
Countdown
0.69
buggy
0.68
date
0.67
rape
0.66
Activations Density 0.014%