INDEX
    Explanations

    apologies and expressions of regret

    expressions of apology and regret

    New Auto-Interp
    Negative Logits
    \">
    -0.78
     guiActiveUn
    -0.76
    rones
    -0.75
     sightings
    -0.74
    qi
    -0.73
     Farming
    -0.71
     mosqu
    -0.71
    ancies
    -0.71
    ndum
    -0.71
    population
    -0.71
    POSITIVE LOGITS
     apologized
    2.08
     apology
    2.07
     apologize
    1.95
     apologizing
    1.92
     apologise
    1.91
     apologies
    1.91
     apologised
    1.91
     remorse
    1.80
     regrets
    1.69
     regret
    1.67
    Act Density 0.671%

    No Known Activations