INDEX
    Explanations

    mentions of societal issues related to discrimination and human rights

    New Auto-Interp
    Negative Logits
    0000000000000000
    -0.66
    lot
    -0.66
    NOW
    -0.66
     lodged
    -0.66
    HAEL
    -0.65
     %%
    -0.65
    soever
    -0.64
     debuted
    -0.63
    LOG
    -0.63
    ALSE
    -0.62
    POSITIVE LOGITS
     lieu
    1.48
    effic
    1.45
     accordance
    1.39
    efficiency
    1.38
     spite
    1.38
     relation
    1.37
     conjunction
    1.31
    roads
    1.28
    clusions
    1.27
    ordinate
    1.25
    Act Density 1.185%

    No Known Activations