INDEX
    Explanations

    negative sentiments or expressions

    New Auto-Interp
    Negative Logits
     Confederation
    -0.74
     Howe
    -0.66
    ĨĴ
    -0.64
     Borders
    -0.64
     Cause
    -0.63
    HCR
    -0.62
     existence
    -0.61
     constitu
    -0.60
     congr
    -0.60
     Province
    -0.60
    POSITIVE LOGITS
    down
    1.17
    happy
    1.13
    downs
    1.09
    out
    1.07
    dash
    1.06
    outs
    1.05
    starting
    1.05
    back
    1.05
    hitting
    1.05
    backs
    1.02
    Act Density 0.035%

    No Known Activations