INDEX
    Explanations

    names, particularly with specific patterns like "Ar___nold" and "A___r___t"

    frequently occurring suffixes or segments of the word "Arnold."

    New Auto-Interp
    Negative Logits
    nesday
    -0.72
    ascus
    -0.63
    ankind
    -0.60
    glers
    -0.60
    auga
    -0.59
    ancial
    -0.58
     sake
    -0.56
    rers
    -0.55
     lifetime
    -0.54
    inctions
    -0.54
    POSITIVE LOGITS
    inian
    0.77
    itect
    0.75
    ansas
    0.71
    agos
    0.71
    Rah
    0.70
     Cortex
    0.70
    INAL
    0.68
    rary
    0.64
    Correct
    0.64
     Refuge
    0.64
    Act Density 0.087%

    No Known Activations