INDEX
    Explanations

    phrases related to specific situations or events

    instances of the word "when."

    New Auto-Interp
    Negative Logits
    agin
    -0.80
    thal
    -0.66
    Bas
    -0.63
    ictive
    -0.63
    yan
    -0.63
    Es
    -0.62
    ha
    -0.62
    aches
    -0.62
    gan
    -0.61
    bas
    -0.61
    POSITIVE LOGITS
    soever
    1.23
     asked
    0.82
     confronted
    0.81
    irlf
    0.79
     pressed
    0.78
     they
    0.76
     contacted
    0.72
    */(
    0.71
     she
    0.71
     faced
    0.70
    Act Density 0.124%

    No Known Activations