INDEX
    Explanations

    mentions of diversity and inclusive themes

    New Auto-Interp
    Negative Logits
    Gue
    -0.72
    TemporalType
    -0.72
    uxxxx
    -0.68
    contentLoaded
    -0.63
    oneph
    -0.62
    bakter
    -0.62
    StringProperty
    -0.61
    InitStruct
    -0.60
    Manbalar
    -0.60
    eqnarray
    -0.60
    POSITIVE LOGITS
     diversity
    2.88
     Diversity
    2.54
    diversity
    2.46
    Diversity
    2.44
     diverse
    2.19
    diverse
    2.13
    Diverse
    2.12
     Diverse
    2.12
     diversify
    1.98
     diversidad
    1.88
    Act Density 0.098%

    No Known Activations