INDEX
    Explanations

    phrases that express contradictions or contrasting ideas

    New Auto-Interp
    Negative Logits
     Cosponsors
    -0.76
    chool
    -0.74
    ogun
    -0.72
    itsch
    -0.70
    lass
    -0.69
    ©¶æ
    -0.63
    psc
    -0.60
    abase
    -0.60
    unes
    -0.59
    cients
    -0.58
    POSITIVE LOGITS
     satisfaction
    0.92
     disappointment
    0.89
     excitement
    0.89
     sadness
    0.88
     certainty
    0.88
     acknowledgement
    0.87
     acknowledgment
    0.85
     happiness
    0.83
     surprises
    0.83
     degradation
    0.82
    Act Density 0.311%

    No Known Activations