INDEX
    Explanations

    the name "Jeff" with a high level of activation

    mentions of the name "Jeff"

    New Auto-Interp
    Negative Logits
     womb
    -0.79
    xual
    -0.74
    velt
    -0.67
    ktop
    -0.63
    ãĥ´
    -0.62
    pmwiki
    -0.62
     Parenthood
    -0.61
     traged
    -0.61
    ãĥķãĤ©
    -0.61
     IMAGES
    -0.61
    POSITIVE LOGITS
    reys
    1.44
    ery
    1.33
    eries
    1.30
    rey
    1.29
    erey
    1.24
    erson
    1.18
     Bezos
    1.16
    ress
    1.02
    ries
    0.99
    isher
    0.96
    Act Density 0.033%

    No Known Activations