INDEX
    Explanations

    proper nouns, particularly names of individuals

    New Auto-Interp
    Negative Logits
     AssemblyCulture
    -0.75
    setVerticalGroup
    -0.62
    ‍♂️
    -0.60
    agic
    -0.58
    tagHelperRunner
    -0.57
     Mr
    -0.56
    AndEndTag
    -0.56
    gridx
    -0.55
    LabelTagHelper
    -0.55
    writeFieldEnd
    -0.53
    POSITIVE LOGITS
     organza
    0.63
    ñora
    0.63
    alee
    0.62
    abelle
    0.62
    lyn
    0.61
    marie
    0.59
    elyn
    0.58
     beaute
    0.58
     Bessel
    0.57
    chelle
    0.57
    Act Density 0.155%

    No Known Activations