INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     UITextView
    -0.07
     recess
    -0.06
    ugins
    -0.06
    973
    -0.06
    .edu
    -0.06
     Know
    -0.06
     Welfare
    -0.06
     často
    -0.06
    unable
    -0.06
     convex
    -0.06
    POSITIVE LOGITS
    .R
    0.10
    R
    0.09
     R
    0.09
    (r
    0.09
    -R
    0.09
    ER
    0.08
    ,R
    0.08
    EL
    0.07
     r
    0.07
    _R
    0.07
    Act Density 0.099%

    No Known Activations