INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     فشار
    -0.06
     Unique
    -0.06
     مطالعه
    -0.06
    ống
    -0.06
     aprend
    -0.06
    adera
    -0.06
    ịnh
    -0.06
     emacs
    -0.06
     Châu
    -0.06
     solving
    -0.06
    POSITIVE LOGITS
    (!(
    0.07
    Suggestions
    0.06
     с
    0.06
     prepares
    0.06
     فوتبال
    0.06
    ær
    0.06
    _define
    0.06
    (handles
    0.06
     erotisk
    0.06
    056
    0.06
    Act Density 0.011%

    No Known Activations