INDEX
    Explanations

    phrases related to requests and confirmations in conversational contexts

    New Auto-Interp
    Negative Logits
    LOPT
    -0.17
    agma
    -0.17
    chg
    -0.16
    essen
    -0.16
    ANTLR
    -0.15
    ÏĥÏĦαÏĥη
    -0.15
    assen
    -0.15
    MUX
    -0.15
    DOG
    -0.14
    _sdk
    -0.14
    POSITIVE LOGITS
     Tw
    0.16
    .tw
    0.15
     Caf
    0.15
    ubs
    0.15
     Rub
    0.15
     tw
    0.14
     Reeves
    0.14
    icari
    0.14
     Robbie
    0.14
     Clan
    0.14
    Act Density 0.039%

    No Known Activations