INDEX
    Explanations

    references to racial identity and issues concerning African Americans

    New Auto-Interp
    Negative Logits
    Portály
    -0.72
     ModelExpression
    -0.68
     hate
    -0.64
     bias
    -0.61
     censiti
    -0.60
    Dislikes
    -0.57
    DotNetBar
    -0.56
    IsMutable
    -0.56
    lencia
    -0.55
     intolerance
    -0.53
    POSITIVE LOGITS
     ReactDOM
    0.69
    ();)
    0.63
     typelib
    0.61
    })$}
    0.60
    eapples
    0.60
     становника
    0.59
    เร็
    0.57
     fallait
    0.56
     kasarigan
    0.56
    DECREF
    0.55
    Act Density 0.025%

    No Known Activations