INDEX
    Explanations

    instances of the word "when"

    instances of the word "when."

    New Auto-Interp
    Negative Logits
    agin
    -0.76
    gan
    -0.72
    rolet
    -0.71
    zzi
    -0.70
    yan
    -0.65
    gem
    -0.65
    hole
    -0.64
    idan
    -0.64
    ictive
    -0.64
    aking
    -0.63
    POSITIVE LOGITS
    soever
    1.39
     confronted
    0.82
    irlf
    0.81
     faced
    0.80
     contrasted
    0.76
     compared
    0.76
     comparing
    0.74
    ŃĶ
    0.68
    ©¶æ
    0.68
     pressed
    0.67
    Act Density 0.122%

    No Known Activations