INDEX
    Explanations

    phrases that indicate actions or intentions aimed at providing assistance or support

    New Auto-Interp
    Negative Logits
     amplified
    -0.17
    orca
    -0.16
    imits
    -0.16
    ignet
    -0.15
    chet
    -0.14
    ipop
    -0.14
     brib
    -0.14
    prech
    -0.14
    ÅŁt
    -0.14
    endon
    -0.14
    POSITIVE LOGITS
     help
    0.23
     enable
    0.20
    help
    0.19
     supplement
    0.19
     guarantee
    0.18
     aid
    0.18
     benefit
    0.18
     helps
    0.18
     ensure
    0.17
     support
    0.17
    Act Density 0.206%

    No Known Activations