INDEX
Explanations
people's last names
words that resemble or are related to various job titles or professions
New Auto-Interp
Negative Logits
ccording
-0.91
erest
-0.85
olulu
-0.68
ADRA
-0.67
acebook
-0.66
ournal
-0.64
clud
-0.62
undermin
-0.62
ĸļ
-0.61
cess
-0.61
POSITIVE LOGITS
lein
1.16
meyer
1.05
mann
1.05
jee
0.93
idge
0.93
berger
0.93
bilt
0.93
wald
0.91
stein
0.91
Jr
0.90
Activations Density 0.099%