INDEX
    Explanations

    names of individuals

    names of people and their titles or roles in a professional context

    New Auto-Interp
    Negative Logits
     partying
    -0.67
     puppies
    -0.63
     physically
    -0.59
     handshake
    -0.58
     stricken
    -0.56
     transitioning
    -0.54
    Classic
    -0.54
     puppy
    -0.54
     nightly
    -0.54
     literal
    -0.54
    POSITIVE LOGITS
     Shapiro
    0.88
     Cheong
    0.86
     Friedman
    0.82
     Schwartz
    0.81
     Rao
    0.81
     Cohen
    0.79
     Rosenthal
    0.79
     Krish
    0.78
    jit
    0.77
     Levin
    0.76
    Act Density 0.464%

    No Known Activations