INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     requesting
    0.79
    を選択
    0.75
     requester
    0.74
     要求
    0.71
    Requ
    0.70
     attains
    0.70
     exprim
    0.69
     undergoes
    0.66
     zaht
    0.66
     auswählen
    0.66
    POSITIVE LOGITS
     help
    2.07
     assist
    1.83
     помочь
    1.81
     offer
    1.79
     helping
    1.77
     hjæl
    1.72
     Help
    1.72
     ayudar
    1.71
    help
    1.70
    帮助
    1.70
    Act Density 1.449%

    No Known Activations