INDEX
    Explanations

    phrases indicating uncertainty or hesitation

    expressions of uncertainty or indecisiveness

    New Auto-Interp
    Negative Logits
    %]
    -0.68
    ufact
    -0.65
    emonium
    -0.65
    verse
    -0.65
    ãĥ¼ãĥĨãĤ£
    -0.63
    ouble
    -0.62
    gencies
    -0.62
    cano
    -0.62
     exting
    -0.59
    ksh
    -0.59
    POSITIVE LOGITS
     whether
    1.34
     why
    1.34
     how
    1.30
     what
    1.10
     if
    1.09
     exactly
    1.05
     WHY
    1.05
    why
    0.99
     about
    0.94
     HOW
    0.93
    Act Density 0.026%

    No Known Activations