INDEX
    Explanations

    phrases related to long-term benefits or consequences

    phrases emphasizing long-term implications or benefits

    New Auto-Interp
    Negative Logits
    Rated
    -0.66
    SK
    -0.65
     Mens
    -0.64
    wb
    -0.63
     Cosponsors
    -0.61
    20439
    -0.60
     Alloy
    -0.60
     tur
    -0.58
    olver
    -0.58
    edIn
    -0.57
    POSITIVE LOGITS
    anwhile
    0.76
    grounds
    0.75
     totality
    0.73
    onies
    0.71
    hedon
    0.70
    nings
    0.70
    stakes
    0.70
    ecycle
    0.69
    ways
    0.67
    forts
    0.67
    Act Density 0.067%

    No Known Activations