INDEX
    Explanations

    instances of the word "shows" and related verbs indicating presentation or demonstration of information

    New Auto-Interp
    Negative Logits
    hlen
    -0.08
    اسب
    -0.07
    oku
    -0.07
    eman
    -0.07
    uchs
    -0.07
     Pence
    -0.07
    æ²¹
    -0.07
    ordan
    -0.06
    ominator
    -0.06
    brahim
    -0.06
    POSITIVE LOGITS
     hoa
    0.07
     promise
    0.07
    .Deep
    0.07
     itself
    0.07
     little
    0.06
     especially
    0.05
     Little
    0.05
     clearly
    0.05
    pires
    0.05
     cref
    0.05
    Act Density 0.022%

    No Known Activations