INDEX
Explanations
descriptions and characteristics of women
New Auto-Interp
Negative Logits
grandfather
-0.16
grandson
-0.15
grands
-0.14
alim
-0.14
sodom
-0.14
unary
-0.14
мо
-0.14
Gay
-0.14
onest
-0.14
homosexual
-0.14
POSITIVE LOGITS
beautiful
0.30
beauty
0.27
gorgeous
0.26
beaut
0.26
attractive
0.25
knockout
0.25
stunning
0.24
dams
0.24
curves
0.24
hottest
0.23
Activations Density 0.392%