INDEX
    Explanations

    pronouns, especially those indicating desire or need for action

    New Auto-Interp
    Negative Logits
    irc
    -0.16
    ÑĨеÑģ
    -0.15
    isher
    -0.14
    iegel
    -0.14
     SOM
    -0.13
    ethe
    -0.13
    ubes
    -0.13
    aptcha
    -0.13
    usz
    -0.13
    ay
    -0.13
    POSITIVE LOGITS
    ãģ£ãģ±
    0.16
    .setViewport
    0.15
    ìĿ´ì§Ģ
    0.15
    kaar
    0.15
     Walters
    0.14
    ÏĦιο
    0.14
     brand
    0.14
    ancode
    0.14
    ÄĽ
    0.14
    ong
    0.13
    Act Density 0.054%

    No Known Activations