INDEX
    Explanations

    references to bias in algorithms and their societal implications

    New Auto-Interp
    Negative Logits
    AnchorTagHelper
    -0.52
    cases
    -0.38
     Wege
    -0.37
    Cyfarwyddwr
    -0.37
     AppCompatTheme
    -0.37
    ̍
    -0.36
    -0.35
     stadig
    -0.35
    ways
    -0.34
     Vereinbarung
    -0.33
    POSITIVE LOGITS
    setVerticalGroup
    0.56
     quality
    0.56
     kwaliteit
    0.54
     Accuracy
    0.51
    msgTypes
    0.49
     flawed
    0.48
    quality
    0.48
     Quality
    0.48
     inaccuracies
    0.47
     QUALITY
    0.47
    Act Density 0.514%

    No Known Activations