INDEX
    Explanations

    instances of first-person singular pronouns and their variations in text

    New Auto-Interp
    Negative Logits
    zcze
    -0.20
    еÑĢж
    -0.15
    dropdown
    -0.15
    IMIT
    -0.14
    ecz
    -0.14
    orus
    -0.14
    lesia
    -0.14
     Nimbus
    -0.14
    imit
    -0.13
    reon
    -0.13
    POSITIVE LOGITS
     want
    0.41
     wants
    0.40
     wanted
    0.33
     wanting
    0.32
    want
    0.31
     Want
    0.30
     Wants
    0.30
     muá»ijn
    0.30
    Want
    0.28
    è¦ģ
    0.27
    Act Density 0.175%

    No Known Activations