INDEX
    Explanations

    references to demographic representation and diversity in a social context

    New Auto-Interp
    Negative Logits
    /Instruction
    -0.17
     å²
    -0.15
     Hosp
    -0.14
    ëıħ
    -0.14
    Monitor
    -0.14
     ebay
    -0.14
    rame
    -0.13
    ulan
    -0.13
    zan
    -0.13
    ovie
    -0.13
    POSITIVE LOGITS
     Science
    0.34
     scientists
    0.33
     STEM
    0.33
     science
    0.33
     scientist
    0.31
    Science
    0.30
     Scientist
    0.30
     Scientists
    0.30
     scientific
    0.29
     Scientific
    0.28
    Act Density 0.011%

    No Known Activations