INDEX
    Explanations

    conversational elements in dialogue

    New Auto-Interp
    Negative Logits
     …..
    -0.75
    ………..
    -0.73
    ”;
    -0.72
     ….
    -0.71
    ……………………
    -0.69
    ….”
    -0.69
    ……..
    -0.69
    …………
    -0.68
    ……….
    -0.67
    ……”
    -0.66
    POSITIVE LOGITS
     ♪
    0.94
    0.79
     Mm
    0.79
    Mm
    0.66
     gonna
    0.65
    -?
    0.59
     Uh
    0.58
    ...
    0.58
    ...?
    0.56
     Ooh
    0.56
    Act Density 0.051%

    No Known Activations