INDEX
    Explanations

    the word "probably" followed by a verb or adjective

    the word "probably" and its associated concepts of uncertainty or conjecture

    New Auto-Interp
    Negative Logits
    yip
    -0.88
    yss
    -0.72
    lete
    -0.68
    ieu
    -0.67
    ieve
    -0.67
    iya
    -0.66
    eli
    -0.65
    bender
    -0.65
    uctor
    -0.65
    letes
    -0.64
    POSITIVE LOGITS
     wouldn
    1.09
     shouldn
    1.08
     won
    1.02
     wont
    0.97
     overest
    0.94
     underestimate
    0.88
     underest
    0.88
    won
    0.86
     never
    0.83
     weren
    0.83
    Act Density 0.085%

    No Known Activations