INDEX
    Explanations

    mentions of public speaking events or interviews

    the word "at" representing various contexts of location or time

    New Auto-Interp
    Negative Logits
    irez
    -0.72
    usercontent
    -0.69
    uay
    -0.67
    activate
    -0.65
    ister
    -0.63
    REDACTED
    -0.59
    alties
    -0.59
    uploads
    -0.58
    escent
    -0.58
     survives
    -0.58
    POSITIVE LOGITS
     length
    1.02
     conferences
    0.96
     least
    0.91
     CES
    0.89
     SX
    0.88
     rallies
    0.87
     halftime
    0.85
     TED
    0.84
     CP
    0.84
     Cannes
    0.81
    Act Density 0.107%

    No Known Activations