INDEX
Explanations
terms related to discrimination and inequality
New Auto-Interp
Negative Logits
ends
-0.16
lify
-0.15
emo
-0.15
تÙĪÙĨ
-0.14
/current
-0.14
ÙħÛĮÙĦادÛĮ
-0.14
spring
-0.14
asty
-0.14
auto
-0.14
oud
-0.13
POSITIVE LOGITS
.scalablytyped
0.16
ulti
0.16
bersome
0.15
ellen
0.15
BootApplication
0.14
.ISupportInitialize
0.14
iveness
0.14
ulerAngles
0.14
غاÙĦ
0.14
kening
0.14
Activations Density 0.023%