INDEX
    Explanations

    names of individuals, possibly focusing on surnames

    proper nouns, particularly names and titles

    New Auto-Interp
    Negative Logits
    rish
    -0.77
    orget
    -0.70
    pmwiki
    -0.69
     bearings
    -0.66
    bilt
    -0.66
    ilde
    -0.65
    hattan
    -0.63
     Medal
    -0.62
    sburgh
    -0.61
    puff
    -0.61
    POSITIVE LOGITS
    manship
    0.81
    hao
    0.77
    ciples
    0.75
    conn
    0.73
    ahime
    0.71
    terior
    0.70
    Introduced
    0.70
    RECT
    0.69
    WINDOWS
    0.67
     mustard
    0.65
    Act Density 0.082%

    No Known Activations