INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     perf
    -0.08
     Tut
    -0.07
    -fe
    -0.07
     Je
    -0.07
    jul
    -0.07
     authoritative
    -0.07
     fractions
    -0.07
    _proc
    -0.07
    (machine
    -0.07
    .FILL
    -0.07
    POSITIVE LOGITS
     оке
    0.09
    0.08
     kawasan
    0.08
     confronto
    0.08
     maga
    0.08
    0.08
    usahaan
    0.08
     শান্ত
    0.08
     کراچی
    0.08
     confronting
    0.08
    Act Density 0.004%

    No Known Activations