INDEX
    Explanations

    phrases related to disparities and differences

    references to disparities or differences, particularly those labeled as "gaps."

    New Auto-Interp
    Negative Logits
    der
    -0.68
    rich
    -0.68
    di
    -0.65
    ivery
    -0.64
    vez
    -0.63
    ise
    -0.60
     vic
    -0.59
    gg
    -0.59
    ocations
    -0.58
    asting
    -0.57
    POSITIVE LOGITS
     between
    1.10
    between
    1.03
     wid
    1.03
     gap
    0.95
     widened
    0.94
     widen
    0.88
     Between
    0.88
     separating
    0.86
     widening
    0.80
     Gap
    0.79
    Act Density 0.046%

    No Known Activations