INDEX
    Explanations

    the word "that" in sentences, possibly indicating a focus on specific contexts or conditions

    the word "that" to identify clauses or statements

    New Auto-Interp
    Negative Logits
    Guard
    -0.67
    oses
    -0.66
    lean
    -0.65
    gur
    -0.64
    respect
    -0.63
    aq
    -0.63
    le
    -0.60
    ounding
    -0.59
    ´
    -0.58
    van
    -0.58
    POSITIVE LOGITS
     they
    0.92
    soever
    0.89
     THEY
    0.84
     there
    0.83
     we
    0.81
     unlike
    0.79
    */(
    0.78
     although
    0.78
     nobody
    0.78
     it
    0.70
    Act Density 0.116%

    No Known Activations