INDEX
    Explanations

    phrases related to consequences or outcomes

    phrases indicating concern or implications for various groups and audiences

    New Auto-Interp
    Negative Logits
    hess
    -0.77
    è¦ļéĨĴ
    -0.77
    qi
    -0.72
    ngth
    -0.69
    ohn
    -0.67
    æŃ¦
    -0.67
    aukee
    -0.65
    DAQ
    -0.64
    hao
    -0.64
    inv
    -0.64
    POSITIVE LOGITS
     anyone
    1.11
     us
    1.07
     sure
    1.06
     anybody
    1.05
    gotten
    1.00
     me
    0.99
     everyone
    0.98
     whoever
    0.98
     everybody
    0.95
     starters
    0.94
    Act Density 0.204%

    No Known Activations