INDEX
    Explanations

    phrases indicating the beginning of information or a story

    the word "here" used repeatedly in various contexts

    New Auto-Interp
    Negative Logits
     Strait
    -0.64
    )].
    -0.63
    cade
    -0.59
     Heights
    -0.58
     maker
    -0.58
    ONSORED
    -0.58
    culosis
    -0.58
    acci
    -0.56
    amines
    -0.56
     Dur
    -0.55
    POSITIVE LOGITS
    tical
    1.27
    tics
    1.26
    abouts
    1.23
    tic
    1.09
    with
    0.74
    upon
    0.74
    ugal
    0.71
    fires
    0.70
    here
    0.70
    oys
    0.69
    Act Density 0.039%

    No Known Activations