INDEX
    Explanations

    words related to discrimination and discussions around it

    instances of the word "discrimination"

    New Auto-Interp
    Negative Logits
     Nieto
    -0.77
     Ire
    -0.68
     Jets
    -0.65
     Pryor
    -0.63
     profession
    -0.63
     heights
    -0.62
     AFP
    -0.62
     resilience
    -0.62
     native
    -0.62
     tall
    -0.62
    POSITIVE LOGITS
    disc
    4.20
    Disc
    2.72
     Disc
    2.03
     disc
    1.49
    disk
    1.33
     discs
    1.25
    deb
    1.20
    stud
    1.11
    isc
    1.10
    DIS
    1.04
    Act Density 0.015%

    No Known Activations