INDEX
    Explanations

    socio-economic status and class disparities

    New Auto-Interp
    Negative Logits
    eree
    -0.17
     Jennings
    -0.15
     em
    -0.15
    eing
    -0.15
    ozem
    -0.14
     prod
    -0.14
    udic
    -0.13
    kaar
    -0.13
    ort
    -0.13
     Womens
    -0.13
    POSITIVE LOGITS
    arios
    0.17
    INV
    0.16
    .jav
    0.16
    èĢIJ
    0.15
    IRT
    0.15
     haut
    0.15
    cmc
    0.14
     Conc
    0.14
    ucch
    0.14
    abwe
    0.14
    Act Density 0.195%

    No Known Activations