INDEX
    Explanations

    proper nouns or names

    distinct names or identifiers related to people or entities

    New Auto-Interp
    Negative Logits
     Giles
    -0.81
     Ange
    -0.78
     Hur
    -0.73
     Gat
    -0.72
    agra
    -0.69
     Hayward
    -0.69
     Gale
    -0.69
     Hastings
    -0.68
     Chev
    -0.67
     HAR
    -0.67
    POSITIVE LOGITS
    n
    1.52
    N
    1.47
    ni
    1.46
    NN
    1.41
    Ns
    1.38
    nn
    1.36
    nb
    1.36
    NI
    1.35
    nr
    1.30
    NT
    1.30
    Act Density 0.585%

    No Known Activations