INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     ISR
    -0.07
     laboratory
    -0.07
     کوه
    -0.07
     lk
    -0.07
     grocery
    -0.07
     slogan
    -0.06
     عملکرد
    -0.06
     kosher
    -0.06
     chimpan
    -0.06
     dispose
    -0.06
    POSITIVE LOGITS
    лаг
    0.07
     Including
    0.07
    іль
    0.07
    OAD
    0.07
    pedia
    0.06
     Require
    0.06
    0.06
    unda
    0.06
    az
    0.06
     chó
    0.06
    Act Density 0.002%

    No Known Activations