INDEX
    Explanations

    occurrences of years in the text

    New Auto-Interp
    Negative Logits
    ambi
    -0.16
    linger
    -0.16
    rror
    -0.15
    stock
    -0.15
    iros
    -0.14
    malink
    -0.14
    anı
    -0.13
     correct
    -0.13
    ningar
    -0.13
    elix
    -0.13
    POSITIVE LOGITS
    rome
    0.15
    ?action
    0.15
    ãĥªãĤ¹
    0.15
    idos
    0.14
    sov
    0.14
    ispers
    0.14
    CharArray
    0.14
     Gib
    0.14
    ILLISE
    0.14
    iswa
    0.13
    Act Density 0.044%

    No Known Activations