INDEX
Explanations
terms related to exclusion and exclusionary practices
New Auto-Interp
Negative Logits
styleType
-0.52
DockStyle
-0.40
WriteAttribute
-0.39
Eg
-0.38
Ин
-0.35
jgl
-0.34
reas
-0.34
市
-0.33
GIH
-0.33
extAlignment
-0.33
POSITIVE LOGITS
Expanded
0.77
excludes
0.73
whoever
0.70
Hvem
0.70
exclude
0.70
excluded
0.69
Who
0.68
expansion
0.68
Quien
0.68
Siapa
0.67
Activations Density 0.236%