INDEX
    Explanations

    phrases related to discrimination based on various characteristics like sexual orientation, race, ethnicity, and gender

    phrases related to discrimination based on various characteristics

    New Auto-Interp
    Negative Logits
    icer
    -0.70
    cells
    -0.70
    mobi
    -0.65
     Sunder
    -0.63
     upd
    -0.63
     Purg
    -0.63
     Booster
    -0.63
     Loop
    -0.62
    knit
    -0.60
     Meteor
    -0.60
    POSITIVE LOGITS
     ethnicity
    1.13
     nationality
    1.09
     gender
    0.99
     racial
    0.90
     religion
    0.88
     creed
    0.85
    reditary
    0.85
     race
    0.85
     colour
    0.83
     color
    0.83
    Act Density 0.310%

    No Known Activations