INDEX
    Explanations

    expressions of dialogue and conversational interactions

    New Auto-Interp
    Negative Logits
     basically
    -0.17
     Basically
    -0.17
    Basically
    -0.16
     nay
    -0.15
    Fuck
    -0.15
    éĥ
    -0.15
    fuck
    -0.14
    huge
    -0.14
    gm
    -0.14
     Fuck
    -0.14
    POSITIVE LOGITS
     sorter
    0.21
     queer
    0.18
     arter
    0.18
    iglia
    0.16
     fellows
    0.16
    couldn
    0.15
     positively
    0.15
     Jest
    0.15
    .want
    0.14
     Que
    0.14
    Act Density 0.342%

    No Known Activations