INDEX
    Explanations

    numeric references to years, particularly those in the 1800s

    New Auto-Interp
    Negative Logits
    foy
    -0.15
    okus
    -0.15
    erland
    -0.15
    seau
    -0.15
    busters
    -0.15
    pery
    -0.15
    adesh
    -0.14
    неÑĢг
    -0.13
    KeyPressed
    -0.13
     Evil
    -0.13
    POSITIVE LOGITS
    enco
    0.16
     Riv
    0.15
    ied
    0.15
    orta
    0.15
    har
    0.14
    rose
    0.14
    olare
    0.14
     Rivers
    0.14
    peat
    0.14
    erved
    0.14
    Act Density 0.010%

    No Known Activations