INDEX
    Explanations

    words related to people's names, specifically the name "William" with a strong activation value

    mentions of the name "William."

    New Auto-Interp
    Negative Logits
    yrinth
    -0.86
    rador
    -0.78
    ebook
    -0.75
    wrapper
    -0.74
    chio
    -0.74
    displayText
    -0.73
    asio
    -0.71
    ADRA
    -0.71
    ordinate
    -0.70
    hess
    -0.70
    POSITIVE LOGITS
    son
    0.92
    \\\\\\\\
    0.91
    sburg
    0.87
     Randolph
    0.85
    stown
    0.85
     Faul
    0.83
     Hague
    0.83
     Wallace
    0.80
    ette
    0.80
     William
    0.80
    Act Density 0.017%

    No Known Activations