INDEX
    Explanations

    proper nouns and specific identifiers within a context

    New Auto-Interp
    Negative Logits
    ocab
    -0.16
    \Base
    -0.15
    essen
    -0.15
    Terminate
    -0.14
    æļ®
    -0.14
     Transcript
    -0.14
     bulk
    -0.14
    prog
    -0.14
    nw
    -0.13
    brains
    -0.13
    POSITIVE LOGITS
    lesi
    0.16
    bum
    0.15
    Exact
    0.15
    PostBack
    0.14
    utz
    0.14
     Sheldon
    0.14
    pbs
    0.14
    mî
    0.14
    577
    0.14
     e
    0.13
    Act Density 0.004%

    No Known Activations