INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     arşiv
    -0.07
    _blocking
    -0.07
     ------>
    -0.07
    navigationBar
    -0.07
     camera
    -0.07
     arkadaş
    -0.07
    ProductId
    -0.06
     участие
    -0.06
     semanas
    -0.06
    isiyle
    -0.06
    POSITIVE LOGITS
     nonsense
    0.13
     junk
    0.13
     Junk
    0.10
    onsense
    0.09
     nons
    0.08
     bogus
    0.08
    unk
    0.07
     kes
    0.07
    bec
    0.07
     Funk
    0.07
    Act Density 0.006%

    No Known Activations