INDEX
    Explanations

    words related to visual or sensory evaluation such as "looks", "tastes", "feels"

    descriptions of visual appearances

    New Auto-Interp
    Negative Logits
    limit
    -0.73
    ulla
    -0.69
    learning
    -0.67
    ilings
    -0.67
    upuncture
    -0.67
    mental
    -0.66
    trl
    -0.66
     Osw
    -0.65
    venient
    -0.64
    ference
    -0.63
    POSITIVE LOGITS
     suspic
    0.93
     like
    0.89
     awfully
    0.88
     identical
    0.84
     strikingly
    0.83
     blurry
    0.81
     sleek
    0.80
     vaguely
    0.79
     prett
    0.79
     shiny
    0.78
    Act Density 0.068%

    No Known Activations