INDEX
    Explanations

    phrases indicating a question or doubt about something

    phrases that express uncertainty or inquiry regarding decisions

    New Auto-Interp
    Negative Logits
    Torrent
    -0.59
    ago
    -0.57
    osp
    -0.56
    que
    -0.55
    ame
    -0.54
    ront
    -0.54
    onew
    -0.53
    stro
    -0.53
    ppo
    -0.53
    +)
    -0.52
    POSITIVE LOGITS
     whether
    3.18
    whether
    2.73
     Whether
    1.95
    Whether
    1.89
     how
    1.34
     regardless
    1.31
     irrespective
    1.27
     why
    1.26
     if
    1.10
     whence
    1.01
    Act Density 0.025%

    No Known Activations