INDEX
    Explanations

    phrases indicating an explanation or clarification

    phrases that emphasize personal opinion or statement

    New Auto-Interp
    Negative Logits
    "},"
    -0.78
    icipated
    -0.67
    \/\/
    -0.66
    onding
    -0.63
    berus
    -0.63
    osure
    -0.62
    Below
    -0.61
    shaw
    -0.60
    illed
    -0.59
    kefeller
    -0.59
    POSITIVE LOGITS
     seriously
    1.06
     honestly
    1.00
     yeah
    0.96
     REALLY
    0.91
     LOOK
    0.86
     wow
    0.85
     yea
    0.84
     literally
    0.83
    htaking
    0.78
     hey
    0.78
    Act Density 0.029%

    No Known Activations