INDEX
    Explanations

    phrases related to uncertainty or inquiries about a situation

    phrases questioning authenticity or certainty

    New Auto-Interp
    Negative Logits
    âĿ
    -0.70
    letter
    -0.69
     Nurs
    -0.66
    irez
    -0.63
    wagon
    -0.62
    ulas
    -0.60
    illac
    -0.60
    checks
    -0.59
    bye
    -0.59
    sectional
    -0.58
    POSITIVE LOGITS
     why
    1.10
     whether
    0.98
     WHY
    0.87
     how
    0.86
     justify
    0.76
    why
    0.73
    whether
    0.73
    yx
    0.71
     whereabouts
    0.71
    wing
    0.69
    Act Density 0.049%

    No Known Activations