INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Pointer
    -0.07
    Review
    -0.07
    gmail
    -0.06
    ุธ
    -0.06
    .liferay
    -0.06
    оды
    -0.06
    Anime
    -0.06
    Runnable
    -0.06
    _POLL
    -0.06
    phies
    -0.06
    POSITIVE LOGITS
     separate
    0.07
    -so
    0.07
    .lv
    0.06
    etherlands
    0.06
    _direct
    0.06
     koc
    0.06
     نش
    0.06
     nhận
    0.06
     σει
    0.06
    \Url
    0.06
    Act Density 0.002%

    No Known Activations