INDEX
    Explanations

    instances of the phrase "I am."

    New Auto-Interp
    Negative Logits
    stants
    -0.17
    INC
    -0.16
    uhl
    -0.16
    ignKey
    -0.16
    inde
    -0.15
    INCT
    -0.15
    batim
    -0.15
    amba
    -0.15
     Uz
    -0.14
    cé
    -0.14
    POSITIVE LOGITS
     buflen
    0.19
    \API
    0.15
    isor
    0.15
    ç¢İ
    0.14
    ieber
    0.14
    heed
    0.14
     QUERY
    0.14
    atha
    0.13
    olate
    0.13
    mah
    0.13
    Act Density 0.004%

    No Known Activations