INDEX
Explanations
phrases related to disparities and differences
references to disparities or differences, particularly those labeled as "gaps."
New Auto-Interp
Negative Logits
der
-0.68
rich
-0.68
di
-0.65
ivery
-0.64
vez
-0.63
ise
-0.60
vic
-0.59
gg
-0.59
ocations
-0.58
asting
-0.57
POSITIVE LOGITS
between
1.10
between
1.03
wid
1.03
gap
0.95
widened
0.94
widen
0.88
Between
0.88
separating
0.86
widening
0.80
Gap
0.79
Activations Density 0.046%