INDEX
    Explanations

    verbs related to physical actions and communication efforts

    New Auto-Interp
    Negative Logits
     بشر
    -0.17
    ï¸
    -0.15
     pretty
    -0.15
    å¾ħ
    -0.15
     need
    -0.14
    elve
    -0.14
    ruk
    -0.14
     loud
    -0.14
     needs
    -0.14
    allows
    -0.14
    POSITIVE LOGITS
     Tried
    0.19
    ëĿ¼ëıĦ
    0.16
     somehow
    0.16
     unsuccessfully
    0.15
     tried
    0.14
    aget
    0.14
    ãĥĨãĥ«
    0.14
    ternet
    0.14
    ongoose
    0.14
    رÙĬع
    0.14
    Act Density 0.138%

    No Known Activations