INDEX
    Explanations

    words related to unintended or unwanted consequences/results

    New Auto-Interp
    Negative Logits
    ardless
    -0.88
    emark
    -0.88
    eworks
    -0.87
    cius
    -0.82
    uits
    -0.81
    imore
    -0.80
    sheet
    -0.79
    lass
    -0.79
    anguage
    -0.77
    ickr
    -0.76
    POSITIVE LOGITS
     pregnancies
    1.15
     consequence
    1.09
     consequences
    1.00
     complication
    0.98
     pregnancy
    0.97
     Parenthood
    0.90
     unintended
    0.84
     collateral
    0.81
     burden
    0.81
     surprises
    0.81
    Act Density 0.076%

    No Known Activations