INDEX
Explanations
dates in various formats
dates or times mentioned in the text
New Auto-Interp
Negative Logits
opic
-0.73
glim
-0.73
princ
-0.70
predec
-0.67
neighb
-0.66
ingred
-0.66
cumbers
-0.63
bom
-0.62
marqu
-0.60
liner
-0.60
POSITIVE LOGITS
âĶľ
0.86
onwards
0.84
edin
0.83
flower
0.79
2018
0.78
ois
0.75
2015
0.73
2016
0.73
2017
0.72
2014
0.71
Activations Density 0.048%