INDEX
    Explanations

    adjectives and descriptive terms related to behavioral traits or characteristics

    New Auto-Interp
    Negative Logits
    tains
    -0.71
    ocular
    -0.66
    empt
    -0.64
    Clar
    -0.63
    ozo
    -0.63
    hon
    -0.62
    othe
    -0.61
    zyme
    -0.61
    odder
    -0.61
    ioxide
    -0.61
    POSITIVE LOGITS
    luster
    0.89
    nesses
    0.82
     owing
    0.75
    mson
    0.75
     miser
    0.72
     due
    0.72
     blacks
    0.66
     Worse
    0.65
     blight
    0.65
     Pradesh
    0.65
    Act Density 0.165%

    No Known Activations