INDEX
    Explanations

    racial or group identifiers used in a negative or segregating way.

    New Auto-Interp
    Negative Logits
     tartalomajánló
    -0.65
    Dữ
    -0.65
    Sucesor
    -0.64
     gynhyrchwyd
    -0.64
    <bos>
    -0.63
    ########.
    -0.62
    __":
    
    -0.59
    adpleegd
    -0.58
    }]
    
    -0.57
     كومونز
    -0.57
    POSITIVE LOGITS
    StoryboardSegue
    0.56
     kimse
    0.52
     autrefois
    0.47
    memoized
    0.47
    MLLoader
    0.47
    UnitTesting
    0.46
    .
    0.44
    ArgumentParser
    0.44
    SuccessListener
    0.43
    vertx
    0.43
    Act Density 2.831%

    No Known Activations