INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     slideshow
    -0.07
     me
    -0.06
     Mag
    -0.06
     spots
    -0.06
     मजब
    -0.06
    SC
    -0.06
     Mos
    -0.06
     Giz
    -0.06
     Peg
    -0.06
     مج
    -0.06
    POSITIVE LOGITS
     Return
    0.13
     return
    0.12
    Return
    0.12
    return
    0.11
    Returns
    0.11
     returns
    0.10
    -return
    0.10
    .return
    0.10
    RETURN
    0.09
    _return
    0.09
    Act Density 0.036%

    No Known Activations