INDEX
    Explanations

    references to discrimination and inequity in various contexts

    mentions of specific names, symbols, or technical elements, often related to programming, calculations, or data analysis.

    New Auto-Interp
    Negative Logits
     sécur
    -0.48
     istrinya
    -0.47
    ießen
    -0.45
     Everyone
    -0.44
     grito
    -0.44
     acompañ
    -0.43
    adpleegd
    -0.43
     rağmen
    -0.43
     kaca
    -0.43
     ajuns
    -0.42
    POSITIVE LOGITS
     Roskov
    0.81
    Autoritní
    0.72
     EconPapers
    0.69
     tuong
    0.60
    ArgumentParser
    0.59
    SequentialGroup
    0.58
     clearColor
    0.57
    ^(@)
    0.56
     kasarigan
    0.55
    occan
    0.54
    Act Density 1.590%

    No Known Activations