INDEX
    Explanations

    requests for help or assistance in various contexts

    New Auto-Interp
    Negative Logits
    annels
    -0.16
    Äħż
    -0.15
    हल
    -0.15
    iteli
    -0.15
    ahren
    -0.14
    .utf
    -0.14
    iens
    -0.14
    iore
    -0.14
    amerate
    -0.14
    iÄħ
    -0.14
    POSITIVE LOGITS
     assistance
    0.29
     help
    0.28
     Assistance
    0.21
     extra
    0.21
    help
    0.20
    -extra
    0.20
     immediate
    0.19
    extra
    0.19
    /w
    0.19
     additional
    0.19
    Act Density 0.099%

    No Known Activations