INDEX
Explanations
dates or mentions of dates
references to dates mentioned in the text
New Auto-Interp
Negative Logits
lins
-0.74
owl
-0.73
ikk
-0.69
rest
-0.66
Gain
-0.66
multipl
-0.64
spectators
-0.64
Cic
-0.63
oul
-0.62
uls
-0.62
POSITIVE LOGITS
dated
4.20
dated
2.29
dating
1.82
Dating
1.69
dates
1.46
date
1.45
dating
1.31
Dates
1.25
date
1.17
Date
1.13
Activations Density 0.008%