INDEX
Explanations
phrases related to diverse individuals or groups
phrases referencing people of color
New Auto-Interp
Negative Logits
Enlarge
-0.67
ertodd
-0.63
Features
-0.62
externalToEVAOnly
-0.60
gallery
-0.59
Dispatch
-0.58
abre
-0.58
PowerPoint
-0.58
Examples
-0.57
Donation
-0.57
POSITIVE LOGITS
sembly
0.89
ortunately
0.85
whom
0.82
course
0.80
ief
0.73
icial
0.67
theirs
0.66
pires
0.64
justice
0.64
course
0.63
Activations Density 0.140%