INDEX
    Explanations

    conversational structures and dialogue markers

    New Auto-Interp
    Negative Logits
    Damn
    -0.16
    akat
    -0.15
    inis
    -0.14
    uche
    -0.14
     seam
    -0.14
    олее
    -0.14
    ingers
    -0.13
    èĤ¥
    -0.13
     Yep
    -0.13
     distinct
    -0.13
    POSITIVE LOGITS
     come
    0.20
     Listen
    0.20
     Exc
    0.20
     Come
    0.20
    Listen
    0.20
     You
    0.19
     look
    0.19
    listen
    0.19
     Look
    0.19
     listen
    0.18
    Act Density 0.303%

    No Known Activations