INDEX
Explanations
phrases related to legal or policy aspects of business practices
occurrences of refusal or denial related to service or rights based on certain criteria
New Auto-Interp
Negative Logits
âĹ¼
-0.76
terness
-0.72
ãĤ´ãĥ³
-0.71
cffffcc
-0.71
Mom
-0.70
itsch
-0.69
luckily
-0.69
isSpecialOrderable
-0.68
Ire
-0.68
çͰ
-0.66
POSITIVE LOGITS
certain
0.93
discriminatory
0.79
licenses
0.75
lawfully
0.74
their
0.73
enance
0.72
discretionary
0.70
objectionable
0.69
harmful
0.69
arbitrary
0.68
Activations Density 0.417%