INDEX
    Explanations

    references to a specific individual named Albert, likely referencing Albert Einstein

    New Auto-Interp
    Negative Logits
    osuke
    -0.86
    ned
    -0.85
    BOOK
    -0.78
    ning
    -0.74
    ners
    -0.72
    ãĥ¯
    -0.71
    efully
    -0.70
    glers
    -0.70
    pter
    -0.70
    packing
    -0.69
    POSITIVE LOGITS
     Einstein
    1.13
     Pu
    0.81
     Schwe
    0.81
    rand
    0.80
    onso
    0.79
    inas
    0.79
     Calder
    0.76
     Wenger
    0.71
    inite
    0.69
     Hammond
    0.69
    Act Density 0.024%

    No Known Activations