INDEX
    Explanations

    phrases beginning with "Did you" prompting for engagement or response

    questions and statements directed towards the audience or reader

    New Auto-Interp
    Negative Logits
    Connector
    -0.76
    artifacts
    -0.76
    heter
    -0.73
    assemb
    -0.72
    limits
    -0.69
     presently
    -0.68
    yond
    -0.67
    Rel
    -0.66
    currently
    -0.63
    Dialogue
    -0.62
    POSITIVE LOGITS
     catch
    0.86
     typo
    0.86
     originally
    0.85
     earlier
    0.83
     stumble
    0.82
     mistake
    0.82
     previously
    0.81
     mention
    0.79
     miss
    0.78
     last
    0.75
    Act Density 0.183%

    No Known Activations