INDEX
    Explanations

    the presence of the pronoun "it" in various contexts

    New Auto-Interp
    Negative Logits
    rij
    -0.07
    iry
    -0.07
    iph
    -0.07
    xd
    -0.06
     p
    -0.06
    GA
    -0.06
     lent
    -0.06
    rnd
    -0.06
    ango
    -0.06
    tright
    -0.06
    POSITIVE LOGITS
    LEM
    0.08
     Corm
    0.07
    LEncoder
    0.06
    ãģŀ
    0.06
    anus
    0.06
     dumps
    0.06
    ÙħÙĦØ©
    0.06
    ืà¸Ńà¸Ķ
    0.06
    owy
    0.06
    lei
    0.06
    Act Density 0.014%

    No Known Activations