INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    wind
    -0.08
    -going
    -0.08
     média
    -0.07
     centrifugal
    -0.07
     hazard
    -0.07
    felder
    -0.07
     apresentam
    -0.07
    .cols
    -0.07
    інді
    -0.07
    gående
    -0.07
    POSITIVE LOGITS
    aug
    0.14
    ost
    0.13
    acked
    0.10
    _aug
    0.09
    obb
    0.09
     augmented
    0.09
     domen
    0.08
     Subjects
    0.08
     હતું
    0.08
    OST
    0.08
    Act Density 0.002%

    No Known Activations