INDEX
    Explanations

    expressions of apology and sympathy

    New Auto-Interp
    Negative Logits
     AppCompat
    -0.68
    WireFormatLite
    -0.64
    Démographie
    -0.62
    Xna
    -0.60
    PreferredItem
    -0.60
    σσ
    -0.59
    ={`/
    -0.59
     endblock
    -0.58
    yntaxException
    -0.57
     Se
    -0.57
    POSITIVE LOGITS
     sorry
    1.84
     SORRY
    1.72
     Sorry
    1.66
    Sorry
    1.47
    sorry
    1.44
     Désolé
    1.21
     Sadler
    1.06
     apologised
    1.01
     Sorrow
    1.01
     Pity
    0.96
    Act Density 0.039%

    No Known Activations