INDEX
Explanations
references to specific ethnic groups
terms related to ethnic identities and groups
New Auto-Interp
Negative Logits
uden
-0.91
aunder
-0.87
tower
-0.84
aday
-0.79
agher
-0.78
=-=-=-=-
-0.76
uckland
-0.75
etheus
-0.75
OHN
-0.73
AUT
-0.73
POSITIVE LOGITS
cleansing
1.15
minorities
1.03
ities
0.92
minority
0.91
backgrounds
0.81
identity
0.80
appropriation
0.79
ancestry
0.79
supremacists
0.79
ethnic
0.79
Activations Density 0.021%