INDEX
    Explanations

    proper nouns, particularly names

    New Auto-Interp
    Negative Logits
    enko
    -0.18
    iÄĻ
    -0.17
    anca
    -0.17
    gart
    -0.17
    íĴį
    -0.16
    uju
    -0.15
    an
    -0.15
    u
    -0.15
    iyah
    -0.15
    est
    -0.15
    POSITIVE LOGITS
    nowled
    0.28
    ismet
    0.24
    adem
    0.23
    ademic
    0.23
    erman
    0.22
    nowledge
    0.22
    robat
    0.20
    zept
    0.20
    cent
    0.20
    47
    0.19
    Act Density 0.011%

    No Known Activations