INDEX
    Explanations

    proper nouns and names, particularly of individuals

    New Auto-Interp
    Negative Logits
    .Atomic
    -0.15
    athi
    -0.15
     yet
    -0.14
     Bab
    -0.14
     Clarkson
    -0.14
    athy
    -0.13
     समर
    -0.13
    iffer
    -0.13
     Blake
    -0.13
    ugg
    -0.13
    POSITIVE LOGITS
    arella
    0.19
     Bian
    0.17
    elan
    0.17
    械
    0.15
    avian
    0.15
     Vaults
    0.15
    afort
    0.15
    esian
    0.15
    antro
    0.15
    igan
    0.15
    Act Density 0.021%

    No Known Activations