INDEX
    Explanations

    concepts related to social theories on race

    New Auto-Interp
    Negative Logits
    thy
    -0.16
    udi
    -0.16
    ubo
    -0.16
    iculo
    -0.15
     bullet
    -0.15
    rtl
    -0.15
    udas
    -0.15
    uda
    -0.15
     Fam
    -0.15
    enheim
    -0.15
    POSITIVE LOGITS
    oki
    0.15
    901
    0.15
     Network
    0.15
    achts
    0.15
    iele
    0.14
    IBUT
    0.14
    ays
    0.14
     arrang
    0.14
    för
    0.14
    eyle
    0.14
    Act Density 0.363%

    No Known Activations