INDEX
    Explanations

    questions or desires to know about a variety of topics

    questions or expressions of curiosity

    New Auto-Interp
    Negative Logits
    onding
    -0.71
    pite
    -0.70
    ovie
    -0.66
    interstitial
    -0.66
    ©¶æ¥µ
    -0.63
     projection
    -0.62
    ufact
    -0.62
    twitch
    -0.61
     permitting
    -0.61
    edition
    -0.61
    POSITIVE LOGITS
     WHY
    1.19
     why
    1.14
     how
    1.03
     whether
    1.00
    why
    0.97
     ABOUT
    0.94
     WHERE
    0.93
     answers
    0.90
     HOW
    0.90
     about
    0.89
    Act Density 0.098%

    No Known Activations