INDEX
    Explanations

    phrases related to personal responsibility and consequences

    New Auto-Interp
    Negative Logits
    currently
    -0.88
    interstitial
    -0.77
    eeper
    -0.72
    urry
    -0.66
    FIG
    -0.66
    aldi
    -0.64
    SPONSORED
    -0.63
    urrent
    -0.62
    current
    -0.62
    odge
    -0.61
    POSITIVE LOGITS
     yesterday
    0.96
     wrong
    0.88
    terday
    0.84
     ago
    0.84
     nob
    0.81
     incompet
    0.77
     Watergate
    0.76
     countless
    0.74
     fools
    0.74
     greatness
    0.73
    Act Density 0.757%

    No Known Activations