INDEX
Explanations
references to people, specifically in a professional or collaborative context
Names connected by "and"
names after 'and'
New Auto-Interp
Negative Logits
Judith
-0.69
Shirley
-0.66
Doris
-0.64
Judith
-0.64
Geraldine
-0.63
Ethel
-0.61
NOPQRST
-0.58
########.
-0.57
Phyllis
-0.57
Wilma
-0.56
POSITIVE LOGITS
Ryan
0.92
Matt
0.86
Kyle
0.81
Ryan
0.79
Adam
0.77
Kyle
0.76
ryan
0.74
Josh
0.72
Matt
0.72
Nate
0.70
Activations Density 0.175%