INDEX
    Explanations

    instructional or guiding language cues, such as "Let's", "So", and "First"

    instances of introductory phrases or sentences

    New Auto-Interp
    Negative Logits
    steen
    -0.67
     ..."
    -0.63
    morrow
    -0.63
    realDonaldTrump
    -0.62
     constitu
    -0.61
    TRUMP
    -0.60
    Romney
    -0.59
     restores
    -0.58
    onement
    -0.58
    vernment
    -0.57
    POSITIVE LOGITS
     nutshell
    0.93
     Concept
    0.77
     Basics
    0.75
    Introduction
    0.73
    Overview
    0.73
     Overview
    0.72
     originally
    0.71
    Previous
    0.71
     Designed
    0.71
     Introduction
    0.70
    Act Density 0.666%

    No Known Activations