INDEX
    Explanations

    apologies or expressions of regret

    New Auto-Interp
    Negative Logits
    Ranked
    -1.19
    tnc
    -1.03
    arnaev
    -0.95
    irrel
    -0.95
    ieth
    -0.91
    icle
    -0.91
    krit
    -0.90
    sports
    -0.89
    edience
    -0.89
    buck
    -0.87
    POSITIVE LOGITS
     sorry
    1.31
     excuse
    1.29
    GES
    1.13
     excuses
    1.01
    tm
    1.00
    sorry
    0.99
    vm
    0.98
    Sorry
    0.94
     Customers
    0.92
    SQL
    0.91
    Act Density 0.392%

    No Known Activations