INDEX
    Explanations

    references to historical time periods

    references to specific decades, particularly the 19th century

    New Auto-Interp
    Negative Logits
    paralle
    -0.76
    eway
    -0.73
    acci
    -0.72
    etitive
    -0.72
    spin
    -0.69
    notation
    -0.69
    egal
    -0.68
    ndra
    -0.68
    ringe
    -0.67
    etary
    -0.67
    POSITIVE LOGITS
     1863
    0.83
     1861
    0.80
     1860
    0.75
     1862
    0.74
     1890
    0.73
     1900
    0.70
     1905
    0.70
     1909
    0.69
     1865
    0.69
     1910
    0.68
    Act Density 0.016%

    No Known Activations