INDEX
    Explanations

    the names of authors and titles of books

    New Auto-Interp
    Negative Logits
    ebi
    -0.16
    aan
    -0.15
    linger
    -0.15
    abbo
    -0.14
    esso
    -0.13
    ds
    -0.13
    erde
    -0.13
    ìĥģìĿĦ
    -0.13
    ecer
    -0.13
    eper
    -0.13
    POSITIVE LOGITS
    çak
    0.16
    ouv
    0.15
     Hoch
    0.14
    igon
    0.14
    urance
    0.14
     çĻ¾åº¦
    0.14
    ãģıãģł
    0.14
    оком
    0.14
    ogl
    0.14
     Rosenberg
    0.14
    Act Density 0.141%

    No Known Activations