INDEX
    Explanations

    discussions surrounding social justice and systemic injustices

    injustice, violence, and suffering

    New Auto-Interp
    Negative Logits
    -0.56
     gyhoeddwyd
    -0.56
     CreateTagHelper
    -0.52
     kasarigan
    -0.50
    kheim
    -0.50
    -0.50
    ayage
    -0.49
    lilla
    -0.49
     Warga
    -0.48
    Eti
    -0.48
    POSITIVE LOGITS
    /**
    0.42
    AndroidJUnit
    0.35
    apimachinery
    0.33
    bitField
    0.32
    kében
    0.31
     crimes
    0.31
     utafitiHapana
    0.31
    toxicity
    0.31
    évaluateur
    0.30
     violence
    0.30
    Act Density 0.191%

    No Known Activations