INDEX
    Explanations

    names and terms related to authors and their works

    New Auto-Interp
    Negative Logits
    ustr
    -0.15
    AYER
    -0.14
    AVIS
    -0.14
    beck
    -0.14
    imum
    -0.14
    teri
    -0.13
    bled
    -0.13
    ERY
    -0.13
    nob
    -0.13
    blo
    -0.13
    POSITIVE LOGITS
    656
    0.16
    ñana
    0.16
    ÑĸйÑģÑĮкоÑĹ
    0.14
    áo
    0.14
     ç½
    0.14
    ryan
    0.14
     Batter
    0.14
    odal
    0.13
     pile
    0.13
    ãģĤ
    0.13
    Act Density 0.095%

    No Known Activations