INDEX
    Explanations

    instances of the word "respond" and its variations, indicating responses to questions or situations

    New Auto-Interp
    Negative Logits
    IMER
    -0.16
    opoulos
    -0.16
    abee
    -0.15
    /plain
    -0.15
     biên
    -0.14
    ODEV
    -0.14
    ÃŃl
    -0.14
    ourg
    -0.14
    ød
    -0.14
     sẵn
    -0.13
    POSITIVE LOGITS
    ivate
    0.20
    /respond
    0.16
     with
    0.16
     Tanner
    0.16
    ingly
    0.16
     bằng
    0.16
    ogle
    0.15
    ants
    0.15
     differently
    0.15
     response
    0.15
    Act Density 0.048%

    No Known Activations