INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    bage
    -0.06
    inosaur
    -0.06
     Manuel
    -0.06
    اصيل
    -0.06
     suppliers
    -0.06
    experimental
    -0.06
    Global
    -0.06
    ظمة
    -0.06
     parfait
    -0.06
    =explode
    -0.06
    POSITIVE LOGITS
    (HWND
    0.07
     какой
    0.06
     depress
    0.06
     région
    0.06
    rapid
    0.06
     VIR
    0.06
    AYOUT
    0.06
     inverted
    0.06
    ","#
    0.06
    _:*
    0.06
    Act Density 0.036%

    No Known Activations