INDEX
    Explanations

    instances of the pronoun "it"

    New Auto-Interp
    Negative Logits
    ronym
    -0.17
    vál
    -0.16
    amment
    -0.15
    å³°
    -0.15
    aise
    -0.15
    iales
    -0.14
    aÅŁ
    -0.14
    ót
    -0.14
    çĤ¸
    -0.14
    iosa
    -0.13
    POSITIVE LOGITS
     Its
    0.22
    Its
    0.22
     its
    0.20
    its
    0.17
     own
    0.17
     fich
    0.16
     Own
    0.15
    381
    0.15
     itself
    0.14
    åħ¶
    0.14
    Act Density 0.146%

    No Known Activations