INDEX
    Explanations

    introductory phrases indicating a shift in topic or a new piece of information

    the word "Here" as a signal for an introduction to various topics or lists

    New Auto-Interp
    Negative Logits
    existent
    -0.56
     absor
    -0.56
    Doctors
    -0.55
     Archdemon
    -0.55
     nib
    -0.55
    )].
    -0.55
     Circle
    -0.54
    mong
    -0.54
     systemd
    -0.54
     sshd
    -0.54
    POSITIVE LOGITS
    tical
    1.27
    tics
    1.26
    abouts
    1.20
    tic
    1.10
    fore
    0.82
    upon
    0.82
    after
    0.80
    yers
    0.79
     Comes
    0.77
    itia
    0.77
    Act Density 0.034%

    No Known Activations