INDEX
Explanations
references to specific events and dates
Months or abbreviations of months
months and state abbreviations
New Auto-Interp
Negative Logits
Whilst
-1.08
Whilst
-1.01
luß
-0.94
whilst
-0.93
enquiry
-0.81
eighteen
-0.81
Amongst
-0.79
poichè
-0.76
fuck
-0.75
seventeen
-0.74
POSITIVE LOGITS
¦
0.84
—
0.83
¦
0.83
0.83
Sept
0.71
0.70
0.67
••
0.64
,’’
0.63
Fig
0.63
Activations Density 0.539%