INDEX
    Explanations

    phrases related to gaps and disparities

    references to various types of gaps, particularly socioeconomic or systemic disparities

    New Auto-Interp
    Negative Logits
    der
    -0.74
    vez
    -0.70
    abad
    -0.69
    ovy
    -0.64
    cause
    -0.64
    rich
    -0.63
    mberg
    -0.62
     Die
    -0.62
    da
    -0.60
    oran
    -0.60
    POSITIVE LOGITS
     between
    1.01
     wid
    1.00
     gap
    0.94
     widened
    0.90
    between
    0.85
     Between
    0.82
     gaps
    0.81
     widen
    0.80
     Gap
    0.79
     separating
    0.77
    Act Density 0.034%

    No Known Activations