INDEX
    Explanations

    the word "that" in various contexts

    New Auto-Interp
    Negative Logits
    zos
    -0.70
    Tex
    -0.60
    izont
    -0.60
    lin
    -0.59
    oses
    -0.59
    apolis
    -0.58
    Luck
    -0.57
    istors
    -0.56
    ormons
    -0.56
    ono
    -0.56
    POSITIVE LOGITS
     pesky
    1.01
     fateful
    1.01
     elusive
    0.87
    cher
    0.85
     aspect
    0.84
    chers
    0.79
     same
    0.77
    ched
    0.75
    ching
    0.69
     tendency
    0.69
    Act Density 0.166%

    No Known Activations