INDEX
    Explanations

    references to social dynamics and disparities among groups

    New Auto-Interp
    Negative Logits
    .subplots
    -0.16
    ãĥ¼ãĥĢ
    -0.13
    obo
    -0.13
     åIJĦ
    -0.13
    ystack
    -0.13
    avin
    -0.13
    opus
    -0.13
    oras
    -0.12
    ymous
    -0.12
    BJECT
    -0.12
    POSITIVE LOGITS
     few
    0.75
    few
    0.67
     handful
    0.65
     Few
    0.64
    Few
    0.60
     fewer
    0.52
     select
    0.51
     quelques
    0.47
    å°ij
    0.45
    select
    0.36
    Act Density 0.274%

    No Known Activations