INDEX
Explanations
names of individuals
names of people and their titles or roles in a professional context
New Auto-Interp
Negative Logits
partying
-0.67
puppies
-0.63
physically
-0.59
handshake
-0.58
stricken
-0.56
transitioning
-0.54
Classic
-0.54
puppy
-0.54
nightly
-0.54
literal
-0.54
POSITIVE LOGITS
Shapiro
0.88
Cheong
0.86
Friedman
0.82
Schwartz
0.81
Rao
0.81
Cohen
0.79
Rosenthal
0.79
Krish
0.78
jit
0.77
Levin
0.76
Activations Density 0.464%