INDEX
    Explanations

    expressions of warnings or stern advice

    New Auto-Interp
    Negative Logits
    heça
    -0.61
    Cordialement
    -0.58
    Amicalement
    -0.57
    Bibliograf
    -0.57
    RenderAtEndOf
    -0.56
    SharedDtor
    -0.55
    новниш
    -0.52
    ništvo
    -0.51
    pozdrawiam
    -0.50
    ьаж
    -0.50
    POSITIVE LOGITS
     promised
    0.76
     warned
    0.74
    iastes
    0.73
     promise
    0.70
     promises
    0.65
    arschu
    0.64
     instructed
    0.61
     admon
    0.61
     StyleSheet
    0.61
     warnings
    0.60
    Act Density 0.378%

    No Known Activations