INDEX
    Explanations

    questions beginning with the word "What"

    New Auto-Interp
    Negative Logits
    xies
    -0.15
     whats
    -0.14
    eres
    -0.14
    inars
    -0.14
    offer
    -0.14
    hits
    -0.14
     forKey
    -0.13
    _acquire
    -0.13
    irected
    -0.13
    åĨĬ
    -0.13
    POSITIVE LOGITS
     Does
    0.27
     does
    0.25
     Are
    0.25
     do
    0.25
     are
    0.21
    soever
    0.21
     Do
    0.20
     Makes
    0.20
     Is
    0.20
     Happ
    0.20
    Act Density 0.053%

    No Known Activations