INDEX
    Explanations

    proper nouns, specifically focusing on names of individuals and places

    proper nouns, specifically names

    New Auto-Interp
    Negative Logits
     alarm
    -0.71
     bias
    -0.68
     account
    -0.67
     Hal
    -0.62
     cutting
    -0.61
     cut
    -0.59
     adjusting
    -0.59
     estimation
    -0.59
     Im
    -0.59
     research
    -0.59
    POSITIVE LOGITS
    vez
    5.32
    lez
    1.36
    avez
    1.31
    alez
    1.29
    pez
    1.14
    rez
    1.13
    encia
    1.01
     Chavez
    0.97
    urat
    0.96
    irez
    0.95
    Act Density 0.018%

    No Known Activations