INDEX
Explanations
years or decades mentioned in the mid-18th to 19th century
historical years and dates mentioned in the text
New Auto-Interp
Negative Logits
paralle
-0.83
gotten
-0.80
etitive
-0.75
colo
-0.72
itivity
-0.71
acci
-0.70
notation
-0.70
abol
-0.70
atom
-0.67
arella
-0.67
POSITIVE LOGITS
1863
0.83
sie
0.82
1862
0.80
1860
0.79
ãĤ¼ãĤ¦ãĤ¹
0.78
1850
0.78
ILCS
0.77
1861
0.77
hrs
0.75
1870
0.75
Activations Density 0.015%