INDEX
    Explanations

    terms related to social and economic issues, particularly those involving inequality and systemic barriers

    New Auto-Interp
    Negative Logits
     addCriterion
    -0.18
    è¿Ļä¸Ģ
    -0.14
    vetica
    -0.14
    buat
    -0.14
    uchos
    -0.14
    immel
    -0.14
    éĤ£ç§į
    -0.14
     Barrett
    -0.14
    _THIS
    -0.13
    }elseif
    -0.13
    POSITIVE LOGITS
    esson
    0.17
    elere
    0.15
    prav
    0.14
    shall
    0.14
    uer
    0.14
    ansson
    0.14
    aber
    0.14
    ìłĢ
    0.14
     ÎĶη
    0.14
    illos
    0.14
    Act Density 0.218%

    No Known Activations