INDEX
    Explanations

    phrases starting with "that" where the following words are qualitatively describing something

    New Auto-Interp
    Negative Logits
     lts
    -1.11
     lein
    -1.08
     aen
    -1.05
     affor
    -1.05
     Confu
    -1.04
     inappro
    -1.03
     unden
    -1.03
     Middles
    -1.03
     parch
    -1.02
     walter
    -1.01
    POSITIVE LOGITS
    <bos>
    0.97
     that
    0.85
     THAT
    0.79
    that
    0.77
    THAT
    0.70
     That
    0.69
    That
    0.65
     dat
    0.62
     que
    0.59
    Eso
    0.57
    Act Density 0.342%

    No Known Activations