INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     dort
    -0.09
     τε
    -0.08
     professionelle
    -0.08
    -0.08
    -0.08
    professional
    -0.08
    ਪੀ
    -0.08
     Professional
    -0.08
    Professional
    -0.08
     szak
    -0.08
    POSITIVE LOGITS
     이렇게
    0.09
    Таким
    0.08
     solv
    0.08
    ----------------------------------------------------------------------------------------------------------------
    0.08
     parms
    0.08
    using
    0.08
    -sol
    0.08
    iają
    0.08
    stw
    0.08
     becomes
    0.08
    Act Density 0.132%

    No Known Activations