INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     ịbụ
    -0.09
     DBG
    -0.08
    romo
    -0.08
    يبة
    -0.08
    不上
    -0.08
    endance
    -0.08
     versn
    -0.08
     ọgụ
    -0.08
    ibet
    -0.08
     entènèt
    -0.07
    POSITIVE LOGITS
     wants
    0.08
     चाहता
    0.08
     necesita
    0.07
     Provide
    0.07
    >Please
    0.07
    Define
    0.07
     möchten
    0.07
     want
    0.07
     value
    0.07
     Leit
    0.07
    Act Density 0.006%

    No Known Activations