INDEX
Explanations
words related to discrimination and discussions around it
instances of the word "discrimination"
New Auto-Interp
Negative Logits
Nieto
-0.77
Ire
-0.68
Jets
-0.65
Pryor
-0.63
profession
-0.63
heights
-0.62
AFP
-0.62
resilience
-0.62
native
-0.62
tall
-0.62
POSITIVE LOGITS
disc
4.20
Disc
2.72
Disc
2.03
disc
1.49
disk
1.33
discs
1.25
deb
1.20
stud
1.11
isc
1.10
DIS
1.04
Activations Density 0.015%