INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     authority
    -0.08
     fragrance
    -0.08
     autoridade
    -0.08
     вп
    -0.07
     MART
    -0.07
     Beck
    -0.07
     thi
    -0.07
     frustrated
    -0.07
     hopeless
    -0.07
    lessness
    -0.07
    POSITIVE LOGITS
    -return
    0.08
     hardy
    0.08
     Reverse
    0.07
    (or
    0.07
    -valu
    0.07
    ean
    0.07
     Boolean
    0.07
    bita
    0.07
    (Binary
    0.07
    0.07
    Act Density 0.026%

    No Known Activations