INDEX
    Explanations

    the first-person singular pronoun "I"

    New Auto-Interp
    Negative Logits
     Theſe
    -0.77
     Raub
    -0.71
    arrière
    -0.70
     Beſ
    -0.67
     geslacht
    -0.64
    ่านั้น
    -0.64
     Staub
    -0.64
    ämä
    -0.64
     wach
    -0.64
    nson
    -0.63
    POSITIVE LOGITS
     I
    1.45
    I
    1.32
    iI
    0.99
     i
    0.94
    IOUtils
    0.86
    𝗜
    0.86
    pI
    0.86
    aI
    0.85
    आई
    0.84
    𝐼
    0.82
    Act Density 0.163%

    No Known Activations