INDEX
    Explanations

    proper nouns, particularly names of people and places

    New Auto-Interp
    Negative Logits
    afil
    -0.16
    CharCode
    -0.16
    .seed
    -0.15
    NIL
    -0.15
    Trap
    -0.14
     peru
    -0.14
     Kür
    -0.14
    yb
    -0.14
    eree
    -0.14
    æ¨
    -0.14
    POSITIVE LOGITS
    uji
    0.16
    acha
    0.14
    umen
    0.14
    ãĤ¶ãĥ¼
    0.14
    rl
    0.14
    867
    0.14
    938
    0.13
    950
    0.13
    iji
    0.13
     Ele
    0.13
    Act Density 0.032%

    No Known Activations