INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     gyms
    -0.09
    hma
    -0.08
     deport
    -0.08
     collapsed
    -0.07
     zou
    -0.07
    'ob
    -0.07
     cut
    -0.07
     bars
    -0.07
     obe
    -0.07
     gym
    -0.07
    POSITIVE LOGITS
     pristine
    0.10
     spotless
    0.09
     microfiber
    0.09
    0.09
    پاک
    0.08
     Carefully
    0.08
    Often
    0.08
     toner
    0.08
    iftung
    0.08
    עד
    0.08
    Act Density 0.021%

    No Known Activations