INDEX
Explanations
hairstyles and related terms
terms related to hairstyling and haircare
New Auto-Interp
Negative Logits
ourke
-0.76
igm
-0.69
Quentin
-0.68
aer
-0.66
Journalism
-0.66
Ferdinand
-0.65
Gutenberg
-0.64
Luc
-0.63
Sly
-0.62
encia
-0.61
POSITIVE LOGITS
hairst
1.24
wig
1.10
haircut
1.08
scalp
1.06
hair
0.94
dress
0.90
Hair
0.87
shampoo
0.83
lasses
0.82
foll
0.82
Activations Density 0.015%