INDEX
    Explanations

    words related to social embarrassment or shame

    instances of the substring "Emb" followed by various suffixes

    New Auto-Interp
    Negative Logits
    wagen
    -0.84
    ãĥīãĥ©
    -0.76
    creen
    -0.74
    gers
    -0.69
     obsc
    -0.67
     è£ı
    -0.66
    ãĥĥãĥĪ
    -0.66
    heast
    -0.63
    å§«
    -0.63
    culosis
    -0.63
    POSITIVE LOGITS
    arrass
    1.45
    edded
    1.31
    odied
    1.27
    assies
    1.16
    attled
    1.09
    argo
    1.07
    assy
    1.05
    odies
    1.05
    edd
    1.00
    olicy
    0.94
    Act Density 0.059%

    No Known Activations