INDEX
Explanations
female names
words that contain suffixes related to professions or roles
New Auto-Interp
Negative Logits
rador
-0.93
acea
-0.80
imated
-0.78
rosc
-0.75
istani
-0.75
creen
-0.74
ebin
-0.73
arily
-0.71
etheless
-0.70
cellaneous
-0.70
POSITIVE LOGITS
Spears
0.96
Ange
0.93
Jol
0.90
Jenner
0.90
Ambro
0.89
Sturgeon
0.85
Strauss
0.84
Ferr
0.84
Trog
0.84
Griffith
0.83
Activations Density 0.102%