INDEX
    Explanations

    instances of the word "when" in statements indicating uncertainty or lack of knowledge

    occurrences of the word "when."

    New Auto-Interp
    Negative Logits
    agin
    -0.66
    zzi
    -0.65
    kaya
    -0.61
    rolet
    -0.61
    gur
    -0.59
    bear
    -0.59
    actor
    -0.59
    endish
    -0.59
    elman
    -0.59
    edly
    -0.59
    POSITIVE LOGITS
    soever
    1.31
    irlf
    0.94
    abouts
    0.82
     confronted
    0.79
     faced
    0.73
    IPS
    0.72
    theless
    0.70
     pressed
    0.66
     asked
    0.66
     transitioning
    0.65
    Act Density 0.121%

    No Known Activations