INDEX
    Explanations

    proper nouns, particularly names of individuals

    New Auto-Interp
    Negative Logits
     itſelf
    -1.01
     pleaſure
    -0.94
     ſmall
    -0.90
     myſelf
    -0.87
     Anſ
    -0.83
     ſeveral
    -0.81
     Conſ
    -0.80
     Reſ
    -0.80
     Diſ
    -0.79
     Monfieur
    -0.78
    POSITIVE LOGITS
     Edward
    0.76
     George
    0.74
     Charles
    0.74
    djangoproject
    0.72
     James
    0.70
    ("%.
    0.70
     Robert
    0.69
     Henry
    0.69
     getopt
    0.67
    aarrggbb
    0.67
    Act Density 0.898%

    No Known Activations