INDEX
    Explanations

    references to academic articles and their citations

    New Auto-Interp
    Negative Logits
    çķ¥
    -0.15
    _rent
    -0.15
    ANCH
    -0.15
     retina
    -0.14
    baugh
    -0.14
    kek
    -0.14
    luent
    -0.14
    اÙħا
    -0.14
    åħ¸
    -0.14
     Goodman
    -0.14
    POSITIVE LOGITS
    plx
    0.16
     UNS
    0.16
     Eagle
    0.15
    phis
    0.15
    gewater
    0.14
     Naughty
    0.14
    ereum
    0.14
    udson
    0.14
    Official
    0.13
     stripslashes
    0.13
    Act Density 0.006%

    No Known Activations