INDEX
    Explanations

    words related to apologies and expressions of regret

    instances of apologies and expressions of remorse

    New Auto-Interp
    Negative Logits
    weeney
    -0.80
    arnaev
    -0.76
    marked
    -0.72
    Downloadha
    -0.71
    adj
    -0.70
    estial
    -0.69
    markets
    -0.68
    ther
    -0.67
    tein
    -0.67
    ::::::::
    -0.67
    POSITIVE LOGITS
    giving
    1.01
     unres
    0.99
     apology
    0.90
     apologized
    0.88
     apologize
    0.87
     forgiveness
    0.84
     apologised
    0.82
     apologizing
    0.81
     apologise
    0.78
     apologies
    0.78
    Act Density 0.027%

    No Known Activations