INDEX
    Explanations

    phrases indicating a conditional statement

    the word "that" in various contexts

    New Auto-Interp
    Negative Logits
    rior
    -0.71
    Merit
    -0.70
    emis
    -0.69
    rypt
    -0.67
    greg
    -0.66
    asures
    -0.65
    izont
    -0.64
    Shar
    -0.63
    cycles
    -0.63
    IAS
    -0.63
    POSITIVE LOGITS
     happens
    1.02
     happened
    1.01
     translates
    0.95
     pesky
    0.92
     occurs
    0.90
     mattered
    0.90
     entails
    0.90
     occurred
    0.89
     culminated
    0.86
     contradicts
    0.84
    Act Density 0.177%

    No Known Activations