INDEX
    Explanations

    references to influential individuals and their contributions

    New Auto-Interp
    Negative Logits
    amma
    -0.17
    (named
    -0.16
    ory
    -0.15
    зÑĥ
    -0.15
     alphabet
    -0.15
     unnamed
    -0.15
     Alphabet
    -0.15
    indow
    -0.15
    ÙĨدÛĮ
    -0.15
     Named
    -0.14
    POSITIVE LOGITS
     mon
    0.40
     sob
    0.36
     handle
    0.35
     term
    0.31
     alias
    0.30
    sob
    0.30
     tag
    0.30
    handle
    0.29
     epith
    0.28
     catch
    0.28
    Act Density 0.151%

    No Known Activations