INDEX
    Explanations

    discussions about gender and representation in the arts

    New Auto-Interp
    Negative Logits
    ron
    -0.08
    sky
    -0.07
    lius
    -0.07
    OTOR
    -0.06
    omor
    -0.06
    erin
    -0.06
    (ConfigurationManager
    -0.06
    él
    -0.06
    ÙİÙĤ
    -0.06
    TEX
    -0.06
    POSITIVE LOGITS
     unlike
    0.12
     unless
    0.08
    ibri
    0.07
     like
    0.07
     Unlike
    0.07
    åıĬåħ¶
    0.07
    ikat
    0.07
    eca
    0.07
    Unlike
    0.07
    δοÏĤ
    0.07
    Act Density 0.037%

    No Known Activations