INDEX
Explanations
names of individuals, especially those with Eastern European origins
proper nouns, specifically names related to individuals
New Auto-Interp
Negative Logits
venant
-0.82
emouth
-0.79
mingham
-0.73
ocus
-0.71
gement
-0.70
lder
-0.68
itions
-0.68
istries
-0.67
stump
-0.66
crane
-0.65
POSITIVE LOGITS
lov
1.04
rice
0.85
anian
0.85
daq
0.79
grad
0.78
Pav
0.78
oral
0.75
Yel
0.73
Polly
0.73
anski
0.73
Activations Density 0.021%