INDEX
    Explanations

    phrases related to communication and working through issues

    conversational cues indicating necessity or planning for future discussions

    New Auto-Interp
    Negative Logits
     "#
    -0.84
    xtap
    -0.74
     WATCHED
    -0.71
     "@
    -0.67
    CONCLUS
    -0.65
    ĺħ
    -0.65
     "{
    -0.62
    endi
    -0.61
    NFL
    -0.61
     ILCS
    -0.61
    POSITIVE LOGITS
    -"
    1.88
    ..."
    1.71
    â̦"
    1.66
    —"
    1.53
     â̦"
    1.34
    !?"
    1.32
    ?"
    1.30
    â̦."
    1.27
    ?!"
    1.25
    .ãĢį
    1.18
    Act Density 0.388%

    No Known Activations