INDEX
    Explanations

    mentions of clothing, specifically dresses and formal attire

    New Auto-Interp
    Negative Logits
    feito
    -0.07
    erap
    -0.07
    psc
    -0.07
    ocard
    -0.06
    /MIT
    -0.06
    ypse
    -0.06
    hip
    -0.06
    ritt
    -0.06
    olk
    -0.06
     rel
    -0.06
    POSITIVE LOGITS
     worn
    0.08
    MediaType
    0.07
    cratch
    0.07
    èįī
    0.07
    etwork
    0.07
    еÑĤом
    0.07
     bottoms
    0.06
    ETYPE
    0.06
    otel
    0.06
    ÃŃky
    0.06
    Act Density 0.027%

    No Known Activations