INDEX
    Explanations

    occurrences of the letter 'a' or 'i' as single characters

    New Auto-Interp
    Negative Logits
    finity
    -0.07
    LING
    -0.07
    å£
    -0.07
    ibus
    -0.06
    dob
    -0.06
    ract
    -0.06
    ury
    -0.06
    qv
    -0.06
    het
    -0.06
    antan
    -0.06
    POSITIVE LOGITS
    ãĤ¤ãĤ¯
    0.07
    äº
    0.07
    TA
    0.06
    Pk
    0.06
    .EventType
    0.06
    ewire
    0.06
    adece
    0.06
     Ont
    0.06
    .mixin
    0.06
    è¯Ŀ
    0.06
    Act Density 0.058%

    No Known Activations