INDEX
    Explanations

    question phrases starting with 'What'

    questions, particularly those starting with "What."

    New Auto-Interp
    Negative Logits
    ¿½
    -0.71
    interstitial
    -0.70
    20439
    -0.66
    Recommended
    -0.65
    conservancy
    -0.64
    ãĥ¼ãĥĨ
    -0.63
    rition
    -0.62
    udging
    -0.60
    Bey
    -0.58
    ranging
    -0.58
    POSITIVE LOGITS
    ?!
    1.21
    ?!"
    1.21
    !?
    1.18
    !?"
    1.13
     did
    0.99
     happened
    0.99
    ?"
    0.98
    ?'
    0.95
    ??
    0.95
    're
    0.94
    Act Density 0.082%

    No Known Activations