INDEX
    Explanations

    references to style and fashion

    New Auto-Interp
    Negative Logits
    ksam
    -0.18
    imary
    -0.17
    markt
    -0.16
    ofday
    -0.15
     Pazar
    -0.14
    ordes
    -0.14
    éĸĢ
    -0.14
    posting
    -0.14
    stable
    -0.14
    rå
    -0.14
    POSITIVE LOGITS
    rene
    0.20
    lish
    0.19
    list
    0.17
    wart
    0.17
    gia
    0.17
     Sty
    0.17
     styl
    0.17
    lore
    0.17
     Shay
    0.16
    lists
    0.16
    Act Density 0.009%

    No Known Activations