INDEX
    Explanations

    phrases indicating a call to action or directive to do something

    New Auto-Interp
    Negative Logits
     advertised
    -0.69
    eers
    -0.68
    Cong
    -0.65
     david
    -0.65
    loo
    -0.64
    iege
    -0.63
    nesses
    -0.63
     agre
    -0.62
    tem
    -0.61
     brill
    -0.60
    POSITIVE LOGITS
    aways
    1.23
     advantage
    0.98
    overs
    0.97
     aback
    0.93
     heed
    0.84
    away
    0.79
    OVER
    0.78
    YR
    0.78
    autions
    0.77
    ume
    0.77
    Act Density 0.105%

    No Known Activations