INDEX
Explanations
phrases related to gaps and disparities
references to various types of gaps, particularly socioeconomic or systemic disparities
New Auto-Interp
Negative Logits
der
-0.74
vez
-0.70
abad
-0.69
ovy
-0.64
cause
-0.64
rich
-0.63
mberg
-0.62
Die
-0.62
da
-0.60
oran
-0.60
POSITIVE LOGITS
between
1.01
wid
1.00
gap
0.94
widened
0.90
between
0.85
Between
0.82
gaps
0.81
widen
0.80
Gap
0.79
separating
0.77
Activations Density 0.034%