INDEX
    Explanations

    concerns and fears related to social acceptance and identity

    New Auto-Interp
    Negative Logits
    ê
    -0.16
    ober
    -0.14
    ents
    -0.14
    %X
    -0.14
     ê°
    -0.14
    liest
    -0.14
     deb
    -0.13
    iaux
    -0.13
    ä¾
    -0.13
    716
    -0.13
    POSITIVE LOGITS
    γμα
    0.17
    fcn
    0.15
    款
    0.15
    andre
    0.14
    анка
    0.14
    orda
    0.13
    igrams
    0.13
     TOO
    0.13
    styl
    0.13
    aturdays
    0.13
    Act Density 0.153%

    No Known Activations