INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     scena
    -0.09
    -0.09
    -0.08
     Neural
    -0.08
     الري
    -0.08
     exceeding
    -0.08
     fashionable
    -0.07
     neural
    -0.07
     Cena
    -0.07
     Maße
    -0.07
    POSITIVE LOGITS
     responded
    0.08
    anser
    0.08
    therapy
    0.07
    .website
    0.07
    lab
    0.07
    itek
    0.07
     Inquiry
    0.07
    107
    0.07
    -owned
    0.07
     interviewed
    0.07
    Act Density 0.023%

    No Known Activations