INDEX
    Explanations

    references to historical and social injustices

    New Auto-Interp
    Negative Logits
    .Override
    -0.16
    emand
    -0.16
    iev
    -0.15
    podob
    -0.15
    Emitter
    -0.15
    ifo
    -0.14
    ones
    -0.14
    hazi
    -0.14
    essel
    -0.14
    cÃŃm
    -0.14
    POSITIVE LOGITS
     present
    0.41
     alive
    0.32
     presente
    0.28
    alive
    0.28
     Present
    0.27
     everywhere
    0.26
    present
    0.25
    -present
    0.24
    Present
    0.24
     active
    0.23
    Act Density 0.232%

    No Known Activations