INDEX
    Explanations

    phrases indicating systemic issues and challenges related to social and economic disparities

    New Auto-Interp
    Negative Logits
    ắt
    -0.15
    ews
    -0.14
     Ih
    -0.14
    subtype
    -0.14
    wart
    -0.14
    itol
    -0.14
     глаза
    -0.14
    imson
    -0.13
    levision
    -0.13
    uchi
    -0.13
    POSITIVE LOGITS
     shift
    0.19
    Shift
    0.16
     Duty
    0.15
     trend
    0.15
    mour
    0.15
    aminer
    0.14
     active
    0.13
     increase
    0.13
     push
    0.13
     drop
    0.13
    Act Density 0.156%

    No Known Activations