INDEX
    Explanations

    phrases that involve asking for or offering assistance

    expressions and variations of the word "help."

    New Auto-Interp
    Negative Logits
    cms
    -0.70
    thouse
    -0.69
    ivo
    -0.69
    aign
    -0.67
    rained
    -0.66
    ires
    -0.66
    ient
    -0.65
    lot
    -0.62
    fest
    -0.61
    Quantity
    -0.61
    POSITIVE LOGITS
     help
    3.39
     Help
    2.25
     assistance
    2.18
     HELP
    2.17
     aid
    2.14
    help
    2.10
     assist
    2.03
    Help
    1.87
     helps
    1.62
     helping
    1.56
    Act Density 0.038%

    No Known Activations