INDEX
    Explanations

    short phrases or questions starting with "What's" or "What is."

    occurrences of the word "what" in various contexts

    New Auto-Interp
    Negative Logits
     horizont
    -0.79
     seiz
    -0.69
    enegger
    -0.67
     Tid
    -0.66
     wink
    -0.64
    fman
    -0.62
     indoor
    -0.60
     impart
    -0.60
    neys
    -0.59
    zn
    -0.59
    POSITIVE LOGITS
    ¬
    1.13
    ı
    1.04
    ¡
    0.99
    ¹
    0.98
    ª
    0.97
    į
    0.96
    IJ
    0.95
    º
    0.95
    ķ
    0.94
    Ĵ
    0.94
    Act Density 0.044%

    No Known Activations