INDEX
Explanations
references to stereotypes and misrepresentations of Native Americans and other marginalized groups
New Auto-Interp
Negative Logits
transférez
-0.59
estekak
-0.54
tagHelperRunner
-0.47
sauter
-0.42
velocity
-0.41
Absorption
-0.39
placebo
-0.39
Autorizaciones
-0.38
다시
-0.38
保安
-0.38
POSITIVE LOGITS
portrayed
0.74
portrayal
0.70
portrays
0.69
portray
0.69
depict
0.63
depicted
0.60
depiction
0.60
depicts
0.60
portraying
0.59
ritratto
0.54
Activations Density 0.363%