INDEX
Negative Logits
हेल्दी
0.44
khiển
0.44
Ethernet
0.41
Kaspersky
0.41
Macron
0.40
Characteristics
0.40
无关
0.40
Tencent
0.40
Attributes
0.40
耶
0.39
POSITIVE LOGITS
segregation
2.09
segregated
1.80
segreg
1.67
Seg
1.42
seg
1.41
seg
1.40
Jim
1.38
Seg
1.35
Jim
1.33
racially
1.30
Activations Density 0.020%