INDEX
Explanations
names of individuals
proper names, specifically individuals' names
New Auto-Interp
Negative Logits
ocial
-0.91
soc
-0.89
leaders
-0.84
vag
-0.83
flies
-0.82
¥µ
-0.82
subject
-0.80
services
-0.80
hospital
-0.79
safety
-0.79
POSITIVE LOGITS
Snyder
0.97
Rogers
0.97
Martinez
0.96
Rudd
0.94
Reeves
0.94
Fernandez
0.94
Manuel
0.93
Murray
0.93
Morrison
0.92
Torres
0.92
Activations Density 0.123%