INDEX
    Explanations

    instances related to providing help or assistance

    references to requests for assistance or support

    New Auto-Interp
    Negative Logits
    agues
    -0.73
    Fed
    -0.71
    WARN
    -0.66
    hari
    -0.64
    mails
    -0.64
    mAh
    -0.63
    fighters
    -0.63
     Sins
    -0.62
    warn
    -0.62
    Quotes
    -0.61
    POSITIVE LOGITS
     regards
    1.27
     regard
    1.12
    stood
    0.87
    standing
    0.86
     respect
    0.82
    draw
    0.81
     impunity
    0.80
     ease
    0.79
     homework
    0.77
     dignity
    0.75
    Act Density 0.065%

    No Known Activations