INDEX
    Explanations

    proper nouns, specifically names

    New Auto-Interp
    Negative Logits
    DockStyle
    -0.83
    ArrowToggle
    -0.73
     initComponents
    -0.65
     تضيفلها
    -0.64
    ########.
    -0.62
     Dorothy
    -0.62
     Hilda
    -0.61
    Susan
    -0.61
    carol
    -0.61
    Dorothy
    -0.61
    POSITIVE LOGITS
     Cade
    0.79
    Cade
    0.73
     Caleb
    0.71
     Luke
    0.69
    __))
    0.69
     Dylan
    0.69
     Kade
    0.67
     Drew
    0.67
     Ethan
    0.66
    Caleb
    0.65
    Act Density 0.452%

    No Known Activations