INDEX
    Explanations

    quotes and statements from discussions or debates, particularly related to controversial or sensitive topics such as politics

    New Auto-Interp
    Negative Logits
    brance
    -0.74
    ghai
    -0.73
     upstream
    -0.71
    Torrent
    -0.69
    soDeliveryDate
    -0.69
     Maiden
    -0.68
     deforestation
    -0.66
    士
    -0.65
     mortality
    -0.65
    wings
    -0.65
    POSITIVE LOGITS
     rhet
    1.05
     sarcast
    1.00
     applause
    0.99
     incred
    0.98
     rebutt
    0.97
     Kimmel
    0.97
     moderator
    0.95
     chuck
    0.94
     laughter
    0.91
     condesc
    0.91
    Act Density 0.505%

    No Known Activations