INDEX
    Explanations

    positive adjectives describing things or experiences

    New Auto-Interp
    Negative Logits
    avis
    -0.90
    PD
    -0.83
    iph
    -0.80
    pper
    -0.79
    unker
    -0.73
    arers
    -0.72
    bots
    -0.72
    consumer
    -0.71
    idel
    -0.71
    lay
    -0.70
    POSITIVE LOGITS
     wonderful
    0.91
    terday
    0.89
     Wonderful
    0.80
     joy
    0.76
    astically
    0.75
     surprises
    0.75
     gracious
    0.75
     amazing
    0.75
    NESS
    0.73
     sounding
    0.72
    Act Density 0.012%

    No Known Activations