INDEX
Explanations
instances of blond or blonde hair color
references to people with light-colored hair, particularly blonde individuals
New Auto-Interp
Negative Logits
Ö¼
-0.91
displayText
-0.83
apego
-0.81
ablishment
-0.76
arters
-0.76
ROR
-0.74
llah
-0.74
GAN
-0.72
arnaev
-0.71
ADRA
-0.70
POSITIVE LOGITS
wig
1.23
blond
1.15
blonde
1.09
bombshell
1.08
haired
1.05
hair
1.04
hairst
0.99
bread
0.96
haircut
0.95
bob
0.89
Activations Density 0.020%