INDEX
Explanations
timestamps indicating time periods
references to temporal markers or time-related phrases
New Auto-Interp
Negative Logits
okers
-0.60
ARDS
-0.58
adle
-0.58
entials
-0.57
boro
-0.55
ordes
-0.54
nyder
-0.54
onto
-0.53
angan
-0.52
que
-0.52
POSITIVE LOGITS
month
1.10
week
1.06
year
1.03
Updated
0.98
ditch
0.94
night
0.92
updated
0.88
rites
0.85
ingly
0.83
October
0.82
Activations Density 0.042%