INDEX
    Explanations

    references to beauty pageants and personal experiences related to self-acceptance

    New Auto-Interp
    Negative Logits
    mage
    -0.16
    onda
    -0.16
    иÑĩа
    -0.15
    quier
    -0.15
    allee
    -0.15
    daÅŁ
    -0.15
    绩
    -0.15
    ë¶
    -0.15
    roma
    -0.14
    ithub
    -0.14
    POSITIVE LOGITS
     Miss
    0.51
    Miss
    0.43
     MISS
    0.39
     page
    0.38
     miss
    0.36
     beauty
    0.35
    miss
    0.34
    _miss
    0.32
     Beauty
    0.32
     Page
    0.30
    Act Density 0.021%

    No Known Activations