INDEX
Explanations
references to historical segregation and civil rights issues
New Auto-Interp
Negative Logits
loating
-0.15
Secondary
-0.15
.cx
-0.15
ãĥªãĤ«
-0.14
disciplinary
-0.14
velopment
-0.14
thin
-0.13
udit
-0.13
kern
-0.13
fcc
-0.13
POSITIVE LOGITS
rá
0.18
aleb
0.15
aine
0.15
ovi
0.15
rez
0.14
rupt
0.14
imension
0.14
RAL
0.14
itizen
0.14
ÙĦا
0.13
Activations Density 0.086%