INDEX
    Explanations

    references to individuals named Albert, particularly Albert Einstein

    mentions of the name "Albert," particularly in connection with notable figures like Einstein

    New Auto-Interp
    Negative Logits
    BOOK
    -0.85
    ãĥ¯
    -0.84
    ned
    -0.79
    ning
    -0.75
    elcome
    -0.75
    osuke
    -0.74
    pter
    -0.73
    mble
    -0.73
    efully
    -0.73
    ners
    -0.72
    POSITIVE LOGITS
     Einstein
    1.10
     Pu
    0.79
     Calder
    0.78
     Schwe
    0.77
     Heights
    0.71
    onso
    0.70
     Hammond
    0.70
    rand
    0.69
    inas
    0.69
    inates
    0.68
    Act Density 0.025%

    No Known Activations