INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     penas
    -0.08
     система
    -0.08
     gyfr
    -0.07
     SHR
    -0.07
    leka
    -0.07
    -0.07
    killer
    -0.07
    _prog
    -0.07
     spenn
    -0.07
    -0.07
    POSITIVE LOGITS
    udy
    0.08
    יטי
    0.07
     váš
    0.07
    osal
    0.07
    likes
    0.07
     Bott
    0.07
     Sequential
    0.07
     Cater
    0.07
     scenes
    0.06
     Pharm
    0.06
    Act Density 0.088%

    No Known Activations