INDEX
    Explanations

    references to specific dates and events

    New Auto-Interp
    Negative Logits
    utherford
    -0.15
    urent
    -0.15
     same
    -0.15
    ress
    -0.15
     DAM
    -0.15
    iform
    -0.14
    .Args
    -0.14
     Dam
    -0.14
    Ìĥ
    -0.14
    ibia
    -0.14
    POSITIVE LOGITS
    ALES
    0.17
    ephy
    0.15
    iye
    0.15
    ı
    0.14
    indh
    0.14
    308
    0.14
    dera
    0.14
    imity
    0.14
    baru
    0.14
     ped
    0.14
    Act Density 0.047%

    No Known Activations