INDEX
    Explanations

    terms related to aesthetic qualities or appearances

    references to aesthetic qualities and visual appeal

    New Auto-Interp
    Negative Logits
    etime
    -0.83
    king
    -0.74
    woods
    -0.69
    kers
    -0.69
    idden
    -0.68
    sen
    -0.67
    house
    -0.65
    abad
    -0.64
    quist
    -0.63
    atchewan
    -0.62
    POSITIVE LOGITS
     aesthetic
    1.03
     sensibilities
    0.97
     aesthetics
    0.89
     choices
    0.80
     preferences
    0.80
    hetically
    0.79
     flair
    0.79
     tastes
    0.77
     preference
    0.77
    atically
    0.74
    Act Density 0.011%

    No Known Activations