INDEX
    Explanations

    names of individuals

    proper nouns, specifically names of individuals

    New Auto-Interp
    Negative Logits
     includ
    -0.74
    ccording
    -0.72
     looph
    -0.68
     NETWORK
    -0.67
    EStream
    -0.64
     suspic
    -0.64
     destro
    -0.63
     tiss
    -0.62
    ortium
    -0.61
    benefit
    -0.59
    POSITIVE LOGITS
     alike
    1.05
     respectively
    0.78
    axter
    0.75
     versa
    0.75
    oliath
    0.73
    VB
    0.67
    Ru
    0.64
    ilda
    0.63
    avia
    0.62
     thereof
    0.62
    Act Density 0.361%

    No Known Activations