INDEX
    Explanations

    mentions of people or entities wearing specific items or accessories

    instances of the word "wearing."

    New Auto-Interp
    Negative Logits
    cffffcc
    -0.81
    =-=-=-=-
    -0.73
     COUR
    -0.69
    later
    -0.68
    MQ
    -0.66
    demon
    -0.66
    deal
    -0.65
    izoph
    -0.65
    estine
    -0.65
    edia
    -0.65
    POSITIVE LOGITS
     apparel
    0.94
     jeans
    0.89
     worn
    0.87
     clothing
    0.84
     shoes
    0.83
     clothes
    0.80
    ables
    0.79
     robes
    0.79
     underwear
    0.79
    ves
    0.78
    Act Density 0.017%

    No Known Activations