INDEX
    Explanations

    words related to positive feelings or attitudes

    terms related to positivity and positive sentiment

    New Auto-Interp
    Negative Logits
     spo
    -0.69
    opsy
    -0.69
     Brilliant
    -0.67
     Lucia
    -0.67
     Hearts
    -0.66
    loo
    -0.64
     Clarkson
    -0.63
     Wiggins
    -0.60
     Emerson
    -0.59
     STEP
    -0.59
    POSITIVE LOGITS
    itional
    1.59
    itions
    1.48
    itivity
    1.40
    icion
    1.25
    idon
    1.25
    itives
    1.22
    itionally
    1.18
    itiveness
    1.18
    ited
    1.16
    itive
    1.14
    Act Density 0.023%

    No Known Activations