INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     FRONT
    -0.07
     intake
    -0.06
    Decimal
    -0.06
    Twitter
    -0.06
    Arr
    -0.06
     їм
    -0.06
     =======
    -0.06
     muster
    -0.06
     menu
    -0.06
     Twitter
    -0.06
    POSITIVE LOGITS
     studying
    0.07
     studied
    0.07
    0.07
    _payments
    0.07
    ethyl
    0.06
    rtype
    0.06
    esser
    0.06
     Sask
    0.06
    kening
    0.06
     изуч
    0.06
    Act Density 0.023%

    No Known Activations