INDEX
    Explanations

    user-turn boundaries and intent-signaling tokens that mark a direct question or request in chat messages.

    New Auto-Interp
    Negative Logits
     modes
    0.34
    ော်
    0.33
     milliards
    0.33
     aldehydes
    0.33
     states
    0.32
     civilizations
    0.32
     editions
    0.31
     casks
    0.31
     explanations
    0.30
     regimes
    0.30
    POSITIVE LOGITS
     хотите
    0.46
     хочу
    0.45
     고민
    0.37
     I
    0.37
     неболь
    0.35
     плани
    0.35
     ছোট
    0.35
     можете
    0.35
    Want
    0.35
     County
    0.34
    Act Density 0.641%

    No Known Activations