INDEX
    Explanations

    mentions of notable individuals, particularly those with a strong association with the name "Albert", like "Albert Einstein"

    references to the name "Albert," particularly associated with notable figures such as Albert Einstein

    New Auto-Interp
    Negative Logits
    BOOK
    -0.86
    ãĥ¯
    -0.81
    ned
    -0.78
    osuke
    -0.78
    ning
    -0.74
    pter
    -0.72
    efully
    -0.71
    ners
    -0.69
    mble
    -0.68
    ettlement
    -0.68
    POSITIVE LOGITS
     Einstein
    1.18
     Pu
    0.81
     Calder
    0.76
     Schwe
    0.75
    inas
    0.75
    onso
    0.73
    inates
    0.72
    rand
    0.72
     Hammond
    0.72
    anus
    0.69
    Act Density 0.022%

    No Known Activations