INDEX
    Explanations

    the presence of the pronoun "I"

    New Auto-Interp
    Negative Logits
    iyim
    -0.08
    claimer
    -0.07
    _Tis
    -0.07
    chez
    -0.07
    utzer
    -0.06
    gili
    -0.06
    :async
    -0.06
    raud
    -0.06
    æľĹ
    -0.06
    Ñıн
    -0.06
    POSITIVE LOGITS
    oti
    0.06
     Tiny
    0.06
    assen
    0.06
     tar
    0.06
    ä¸Ķ
    0.05
    opt
    0.05
     Grimm
    0.05
     cur
    0.05
     unk
    0.05
    zen
    0.05
    Act Density 0.000%

    No Known Activations