INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    madan
    -0.08
     Deel
    -0.08
     gaf
    -0.08
    .cleaned
    -0.08
     unter
    -0.08
     Intro
    -0.08
    .cert
    -0.08
    .Completed
    -0.08
     بده
    -0.07
     Des
    -0.07
    POSITIVE LOGITS
     radix
    0.10
    -base
    0.09
     numeral
    0.08
     perust
    0.08
    achi
    0.08
     આધાર
    0.08
     branching
    0.08
    _BASE
    0.08
     осн
    0.07
     негіз
    0.07
    Act Density 0.034%

    No Known Activations