INDEX
    Explanations

    phrases related to apologies and expressions of regret

    expressions of apology or regret

    New Auto-Interp
    Negative Logits
    kefeller
    -0.82
    fman
    -0.80
    isSpecialOrderable
    -0.73
    eely
    -0.68
    idth
    -0.68
    quickShipAvailable
    -0.67
    ibaba
    -0.66
    Downloadha
    -0.65
    ighters
    -0.64
    女
    -0.64
    POSITIVE LOGITS
     inconvenience
    1.02
     inconven
    1.00
     unres
    0.87
     saddened
    0.84
     insensitive
    0.82
     sorry
    0.82
    sorry
    0.82
     mistake
    0.82
     interruption
    0.80
     offended
    0.80
    Act Density 0.124%

    No Known Activations