INDEX
Explanations
dates in the format YYYY appearing in the text
references to the year 2011
New Auto-Interp
Negative Logits
phy
-0.84
onite
-0.82
ntil
-0.80
inates
-0.79
imore
-0.75
phia
-0.75
oute
-0.74
odies
-0.72
cumbers
-0.71
olean
-0.71
POSITIVE LOGITS
å¹
1.02
onwards
0.80
-'
0.79
ãĥ¼ãĥĨãĤ£
0.78
20439
0.71
worthiness
0.71
Petraeus
0.69
01
0.68
ford
0.67
aldo
0.67
Activations Density 0.023%