INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     rog
    -0.08
    _visit
    -0.07
    target
    -0.06
    (ent
    -0.06
    uname
    -0.06
    obel
    -0.06
     minib
    -0.06
    ivic
    -0.06
     beden
    -0.06
    Unary
    -0.06
    POSITIVE LOGITS
     neměl
    0.07
    [Boolean
    0.07
    ={!
    0.07
     Moment
    0.06
     ${(
    0.06
    recommend
    0.06
     important
    0.06
    /mp
    0.06
    .office
    0.06
    _LARGE
    0.06
    Act Density 0.006%

    No Known Activations