INDEX
    Explanations

    contractions and words related to decision-making

    positive expressions or affirmations about experiences

    New Auto-Interp
    Negative Logits
    But
    -0.76
    However
    -0.73
     But
    -0.71
    ña
    -0.70
    but
    -0.67
     However
    -0.66
    atl
    -0.65
    afa
    -0.64
     WATCHED
    -0.64
    asp
    -0.63
    POSITIVE LOGITS
     nonetheless
    1.80
     nevertheless
    1.32
    etheless
    1.12
     darn
    0.98
     awfully
    0.93
     anyways
    0.91
     anyway
    0.87
     certainly
    0.85
     damn
    0.85
     gist
    0.83
    Act Density 1.014%

    No Known Activations