INDEX
    Explanations

    phrases introducing new information or situations

    the word "that" in various contexts

    New Auto-Interp
    Negative Logits
     Expect
    -0.68
    */
    -0.64
    ggles
    -0.62
    Join
    -0.61
    fts
    -0.60
    iscover
    -0.60
    etermin
    -0.58
     Legislation
    -0.57
     Disorders
    -0.56
    Sit
    -0.55
    POSITIVE LOGITS
     resembled
    1.56
     consisted
    1.46
     amounted
    1.45
     lasted
    1.45
     resulted
    1.41
     culminated
    1.38
     differed
    1.34
     seemed
    1.34
     lacked
    1.31
     hadn
    1.26
    Act Density 0.248%

    No Known Activations