INDEX
    Explanations

    expressions of public health concerns

    Informal conversational cues ("OK", "gonna", "gotta")

    New Auto-Interp
    Negative Logits
    )”.
    -1.08
    ),”
    -1.07
    +:+
    -0.96
    ’.”
    -0.95
    )”
    -0.95
    ”,
    -0.94
    ,’”
    -0.92
    ”)
    -0.91
    ’”
    -0.90
    ).”
    -0.89
    POSITIVE LOGITS
     gonna
    0.82
     ♪
    0.63
     gotta
    0.61
     OK
    0.58
    gonna
    0.57
     outta
    0.57
     GONNA
    0.55
     uh
    0.54
     tonight
    0.54
     guys
    0.53
    Act Density 0.086%

    No Known Activations