INDEX
    Explanations

    phrases related to assistance or support

    instances of the word "help" in various contexts

    New Auto-Interp
    Negative Logits
     Creed
    -0.70
     ratio
    -0.67
     attraction
    -0.66
     piv
    -0.64
     Ov
    -0.62
     division
    -0.61
     wearing
    -0.61
     gradient
    -0.60
     fusion
    -0.59
     tub
    -0.59
    POSITIVE LOGITS
    help
    4.10
    Help
    2.30
    helps
    1.73
     Help
    1.70
     HELP
    1.65
     help
    1.51
    support
    1.39
    guide
    1.23
     Helpful
    1.15
     helpful
    1.11
    Act Density 0.018%

    No Known Activations