INDEX
    Explanations

    proper nouns associated with places, institutions, or specific demographics

    New Auto-Interp
    Negative Logits
    ipers
    -0.15
     Eg
    -0.15
    hood
    -0.14
     Beard
    -0.14
    UFFIX
    -0.14
    artment
    -0.14
     Tout
    -0.14
     Dove
    -0.13
    886
    -0.13
    stants
    -0.13
    POSITIVE LOGITS
    ilter
    0.16
    /*@
    0.15
    Feat
    0.14
    unpack
    0.14
    enu
    0.14
    ileo
    0.14
    .writeValue
    0.13
    iams
    0.13
     FileAccess
    0.13
    ιÏİ
    0.13
    Act Density 0.004%

    No Known Activations