INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Chang
    -0.09
     blij
    -0.08
     Wandel
    -0.08
    Chang
    -0.08
    ƒ
    -0.08
    دو
    -0.07
     നടത്ത
    -0.07
    اندې
    -0.07
    (single
    -0.07
    .Single
    -0.07
    POSITIVE LOGITS
     congr
    0.09
     periodic
    0.08
     modulo
    0.08
    Cong
    0.08
    Periodic
    0.08
     channel
    0.07
    irthday
    0.07
    0.07
    _channel
    0.07
    -channel
    0.07
    Act Density 0.015%

    No Known Activations