INDEX
    Explanations

    references to literary works and authors

    New Auto-Interp
    Negative Logits
    emey
    -0.19
    iris
    -0.17
    ENAME
    -0.16
    :;↵
    -0.15
    uguay
    -0.15
    ourg
    -0.15
    erland
    -0.15
     tram
    -0.14
     ä¿Ŀ
    -0.14
    ément
    -0.14
    POSITIVE LOGITS
     Shakespeare
    0.30
    akespeare
    0.24
     Ham
    0.21
     Romeo
    0.20
     Juliet
    0.18
     Dane
    0.17
    itus
    0.17
     íĸ
    0.17
    íĸī
    0.17
    peare
    0.17
    Act Density 0.048%

    No Known Activations