INDEX
    Explanations

    proper nouns, particularly names and titles

    New Auto-Interp
    Negative Logits
    ingleton
    -0.19
    Ñīин
    -0.18
    зв
    -0.17
    uum
    -0.16
    OKIE
    -0.15
    /WebAPI
    -0.15
    AttributeValue
    -0.15
    ANGO
    -0.15
    .Guna
    -0.15
    ÅĻi
    -0.14
    POSITIVE LOGITS
    eler
    0.17
     WS
    0.17
    ny
    0.16
    WS
    0.15
     Neal
    0.15
    ruk
    0.15
    isan
    0.14
    Äı
    0.14
    zz
    0.14
    GLOBALS
    0.14
    Act Density 0.009%

    No Known Activations