INDEX
    Explanations

    phrases related to diverse individuals or groups

    phrases referencing people of color

    New Auto-Interp
    Negative Logits
    Enlarge
    -0.67
    ertodd
    -0.63
    Features
    -0.62
     externalToEVAOnly
    -0.60
    gallery
    -0.59
     Dispatch
    -0.58
    abre
    -0.58
     PowerPoint
    -0.58
    Examples
    -0.57
     Donation
    -0.57
    POSITIVE LOGITS
    sembly
    0.89
    ortunately
    0.85
     whom
    0.82
     course
    0.80
    ief
    0.73
    icial
    0.67
     theirs
    0.66
    pires
    0.64
     justice
    0.64
    course
    0.63
    Act Density 0.140%

    No Known Activations