INDEX
Explanations
references to racism and racial discrimination
New Auto-Interp
Negative Logits
__":
-0.74
ThroughAttribute
-0.70
IBarButtonItem
-0.69
VersionUID
-0.68
utafitiHapana
-0.67
TagMode
-0.67
AssemblyVersion
-0.67
Tikang
-0.64
KommentareTeilen
-0.64
%">
-0.64
POSITIVE LOGITS
racial
1.18
racially
1.02
Racial
1.00
racism
0.99
racist
0.94
Racism
0.92
racial
0.91
race
0.86
Racism
0.83
discrimination
0.80
Activations Density 0.451%