INDEX
    Explanations

    names of places or individuals

    the names of individuals or entities

    New Auto-Interp
    Negative Logits
    ====
    -0.80
     cx
    -0.74
     CY
    -0.74
     ELE
    -0.73
     Retro
    -0.72
     Ao
    -0.71
     ACS
    -0.71
     Gy
    -0.70
     CoC
    -0.69
    ATES
    -0.69
    POSITIVE LOGITS
    man
    1.87
    mans
    1.62
    mann
    1.57
    MAN
    1.44
    men
    1.31
    eman
    1.23
    linger
    1.16
    woman
    1.13
    heimer
    1.12
    father
    1.12
    Act Density 0.085%

    No Known Activations