INDEX
    Explanations

    references to beauty and descriptions of aesthetic appeal

    New Auto-Interp
    Negative Logits
    tr
    -0.44
    tis
    -0.43
    pu
    -0.43
    pat
    -0.43
    tu
    -0.42
    onga
    -0.42
    ter
    -0.42
     grom
    -0.42
    gn
    -0.41
    pan
    -0.41
    POSITIVE LOGITS
     beautiful
    1.20
    beautiful
    1.11
     beauty
    1.07
     BEAUTIFUL
    1.03
    Beautiful
    1.03
     Beautiful
    0.99
     BEAUTY
    0.97
    Beauty
    0.96
    beauty
    0.96
     Beauty
    0.94
    Act Density 0.219%

    No Known Activations