INDEX
    Explanations

    instances of blond or blonde hair color

    references to people with light-colored hair, particularly blonde individuals

    New Auto-Interp
    Negative Logits
    Ö¼
    -0.91
    displayText
    -0.83
    apego
    -0.81
    ablishment
    -0.76
    arters
    -0.76
    ROR
    -0.74
    llah
    -0.74
    GAN
    -0.72
    arnaev
    -0.71
    ADRA
    -0.70
    POSITIVE LOGITS
     wig
    1.23
     blond
    1.15
     blonde
    1.09
     bombshell
    1.08
    haired
    1.05
     hair
    1.04
     hairst
    0.99
    bread
    0.96
     haircut
    0.95
     bob
    0.89
    Act Density 0.020%

    No Known Activations