INDEX
    Explanations

    expressions of remorse and apologies

    New Auto-Interp
    Negative Logits
    BeginContext
    -0.62
    clusal
    -0.58
     Italijanski
    -0.55
    urator
    -0.54
    рядок
    -0.54
     userSchema
    -0.54
    arean
    -0.53
     nourished
    -0.52
     conserve
    -0.52
    zzleHttp
    -0.51
    POSITIVE LOGITS
     apologized
    1.33
     apologizing
    1.32
     apology
    1.31
     apologize
    1.22
     apologised
    1.19
     apologies
    1.16
     remorse
    1.11
     apologise
    1.10
     apologe
    1.09
     apolog
    1.03
    Act Density 0.250%

    No Known Activations