INDEX
    Explanations

    questions and inquiries

    questions starting with the word "What."

    New Auto-Interp
    Negative Logits
     favour
    -0.76
     reserve
    -0.74
     favor
    -0.74
     arch
    -0.71
     close
    -0.67
     receiving
    -0.64
     firm
    -0.64
     ranking
    -0.64
     res
    -0.63
     due
    -0.63
    POSITIVE LOGITS
    What
    2.81
    Why
    2.21
    How
    2.11
    Who
    2.08
    WHAT
    2.05
    what
    2.04
    Where
    1.86
    Which
    1.81
     What
    1.77
    Whatever
    1.70
    Act Density 0.022%

    No Known Activations