INDEX
    Explanations

    phrases related to reactions and responses

    phrases indicating reactions or responses to announcements or events

    New Auto-Interp
    Negative Logits
     entrusted
    -0.78
    agate
    -0.64
    hart
    -0.62
    ouf
    -0.62
    ciples
    -0.61
    yrights
    -0.60
     Parameters
    -0.60
    dimension
    -0.60
    emy
    -0.60
    kun
    -0.59
    POSITIVE LOGITS
     disbelief
    1.36
     dismay
    1.35
     incred
    1.34
     cheers
    1.33
     applause
    1.30
     skepticism
    1.28
     silence
    1.28
     laughter
    1.25
     puzz
    1.25
     indifference
    1.25
    Act Density 0.213%

    No Known Activations