INDEX
    Explanations

    phrases related to pointing out or emphasizing specific information

    the word "that" in various contexts

    New Auto-Interp
    Negative Logits
    rete
    -0.81
    Course
    -0.75
    icking
    -0.69
    istics
    -0.69
    iscover
    -0.68
    register
    -0.66
    Laughs
    -0.66
    fect
    -0.65
    oleon
    -0.65
    arak
    -0.64
    POSITIVE LOGITS
     justifies
    1.19
     accompanies
    1.19
     contradicts
    1.18
     proves
    1.13
     suggests
    1.11
     undermines
    1.08
     resulted
    1.05
     contradicted
    1.01
     indicates
    1.00
     prevailed
    0.98
    Act Density 0.137%

    No Known Activations