INDEX
Explanations
references to historical events and systemic issues affecting Black Americans
New Auto-Interp
Negative Logits
gression
-0.17
apult
-0.15
ideos
-0.15
rv
-0.15
Bash
-0.14
UnitTest
-0.14
-env
-0.14
evac
-0.14
oples
-0.13
awa
-0.13
POSITIVE LOGITS
lyn
0.45
Ku
0.37
Lyn
0.36
ynch
0.36
lyn
0.36
Lynch
0.34
Jim
0.30
klu
0.29
Klan
0.29
Kl
0.29
Activations Density 0.066%