INDEX
    Explanations

    proper nouns, particularly names and references to individuals

    New Auto-Interp
    Negative Logits
     Diſ
    -1.21
     houſe
    -1.20
     Theſe
    -1.20
    ſelf
    -1.16
     becauſe
    -1.15
     itſelf
    -1.15
     myſelf
    -1.12
     Houſe
    -1.10
     Eſ
    -1.09
     ―――――
    -1.09
    POSITIVE LOGITS
     Von
    0.93
     von
    0.90
    Von
    0.82
     Steph
    0.76
    Steph
    0.74
    machus
    0.72
     Ste
    0.71
     Stephens
    0.69
     degré
    0.67
    ccc
    0.67
    Act Density 0.514%

    No Known Activations