INDEX
    Explanations

    proper nouns, specifically surnames

    New Auto-Interp
    Negative Logits
    ò
    -0.85
     cumbers
    -0.84
     conflic
    -0.81
     eleph
    -0.79
     exting
    -0.78
    ñ
    -0.74
    Orderable
    -0.74
    aditional
    -0.74
    ô
    -0.74
     Takeru
    -0.72
    POSITIVE LOGITS
    man
    1.94
    mans
    1.70
    MAN
    1.58
    mann
    1.49
    men
    1.36
    eman
    1.19
    fman
    1.19
    mania
    1.18
    Man
    1.15
    woman
    1.11
    Act Density 0.083%

    No Known Activations