INDEX
    Explanations

    mentions of fashion, clothing, and social issues related to wealth disparity

    New Auto-Interp
    Negative Logits
     både
    -0.15
    oute
    -0.15
    ainty
    -0.14
    оÑĥ
    -0.14
    .ms
    -0.14
    ota
    -0.14
    MORE
    -0.14
    yro
    -0.14
    esy
    -0.14
     sogar
    -0.14
    POSITIVE LOGITS
     nothing
    0.18
    nothing
    0.18
     occasional
    0.17
     immediate
    0.15
     thôi
    0.14
    SizeMode
    0.14
    Nothing
    0.14
     occasionally
    0.14
    ãĥ³ãĥģ
    0.14
     Nothing
    0.14
    Act Density 0.169%

    No Known Activations