INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Thür
    -0.08
     Tren
    -0.08
     Twig
    -0.08
    -ма
    -0.08
     Bog
    -0.07
    ikir
    -0.07
    öt
    -0.07
     Differential
    -0.07
    aggia
    -0.07
    _markup
    -0.07
    POSITIVE LOGITS
    но
    0.08
    Backend
    0.08
     completes
    0.08
     brilliance
    0.08
     انگی
    0.07
    elapsed
    0.07
     paix
    0.07
    0.07
     stimul
    0.07
    Sigma
    0.07
    Act Density 0.008%

    No Known Activations