INDEX
    Explanations

    expressions of apology or regret

    New Auto-Interp
    Negative Logits
    Werde
    -0.32
     cupboards
    -0.31
    listdir
    -0.31
    sweise
    -0.31
    XtraBars
    -0.31
    からです
    -0.31
    -0.31
     Initiatives
    -0.31
     prefeitura
    -0.31
     villaggio
    -0.31
    POSITIVE LOGITS
     sorry
    1.30
    sorry
    1.25
     SORRY
    1.22
     Sorry
    1.21
    Sorry
    1.20
     apologize
    1.01
     apologies
    0.95
     apologise
    0.95
     apologized
    0.94
     apology
    0.93
    Act Density 0.119%

    No Known Activations