INDEX
    Explanations

    sentences with varying uses of "it" and "that"

    New Auto-Interp
    Negative Logits
    atisfied
    -0.17
     neod
    -0.15
    aucoup
    -0.14
    ób
    -0.14
    storybook
    -0.14
    repr
    -0.14
    :');↵
    -0.14
    rnek
    -0.14
    orer
    -0.14
     Hats
    -0.14
    POSITIVE LOGITS
     occurred
    0.22
     occurs
    0.21
     kind
    0.19
     occur
    0.19
    kind
    0.19
     appears
    0.18
     *
    0.18
     Occ
    0.18
     Freund
    0.17
     ocor
    0.17
    Act Density 0.182%

    No Known Activations