INDEX
    Explanations

    specific noun phrases related to actions or situations

    occurrences of the word "the."

    New Auto-Interp
    Negative Logits
     horizont
    -0.66
    anwhile
    -0.63
     surrounds
    -0.58
    ":["
    -0.57
    ãĤ´ãĥ³
    -0.56
     Signed
    -0.56
    rants
    -0.54
    rought
    -0.54
     throats
    -0.54
     Gret
    -0.54
    POSITIVE LOGITS
     impression
    0.82
     leap
    0.75
    sense
    0.72
     distinction
    0.71
     mistake
    0.66
     distinctions
    0.66
     determin
    0.65
     debut
    0.64
     difference
    0.63
    yssey
    0.63
    Act Density 0.119%

    No Known Activations