INDEX
    Explanations

    mentions of hair-related terms like "hair salon," "salon," and descriptions of hair

    New Auto-Interp
    Negative Logits
     bascul
    -0.51
     ché
    -0.51
    Dawg
    -0.51
     Pockets
    -0.50
     cushi
    -0.48
    mnop
    -0.48
     Stretcher
    -0.47
     Assorted
    -0.47
    Ferdin
    -0.47
     Wrench
    -0.46
    POSITIVE LOGITS
     hair
    1.41
    Hair
    1.32
    hair
    1.29
     Hair
    1.29
     HAIR
    1.23
    HAIR
    1.08
     hairs
    1.00
    haired
    0.99
     haired
    0.92
    hairs
    0.87
    Act Density 0.078%

    No Known Activations