INDEX
    Explanations

    first-person pronouns, indicating personal perspective or experience

    New Auto-Interp
    Negative Logits
    reeze
    -0.18
    eyse
    -0.15
    urga
    -0.15
    AffineTransform
    -0.15
    itol
    -0.14
    chwitz
    -0.14
    eÅŁ
    -0.14
     rám
    -0.14
    CommandEvent
    -0.14
    udget
    -0.14
    POSITIVE LOGITS
    acs
    0.16
    ofs
    0.15
    321
    0.14
    हन
    0.14
     pau
    0.14
    in
    0.14
    ica
    0.14
    so
    0.13
    c
    0.13
    reset
    0.13
    Act Density 0.086%

    No Known Activations