INDEX
    Explanations

    references to literary reviews and publications

    New Auto-Interp
    Negative Logits
    umo
    -0.16
    aec
    -0.14
     Funk
    -0.14
    licken
    -0.14
    ']!='
    -0.14
    ryo
    -0.14
    aits
    -0.14
    oose
    -0.14
    æī¿
    -0.14
    елиÑĩ
    -0.13
    POSITIVE LOGITS
     picks
    0.21
     pick
    0.20
    Pick
    0.20
     starred
    0.20
     Picks
    0.19
     Pick
    0.19
     PICK
    0.18
     Best
    0.16
    SYNC
    0.16
    ìĦł
    0.15
    Act Density 0.022%

    No Known Activations