INDEX
Explanations
references to Africa and related themes like slavery and the African diaspora
New Auto-Interp
Negative Logits
upat
-0.66
aup
-0.63
sweise
-0.61
Domenico
-0.60
lot
-0.59
ゆ
-0.58
ting
-0.57
mi
-0.55
ıyla
-0.55
olge
-0.55
POSITIVE LOGITS
Africans
1.61
Africa
1.59
Africa
1.47
África
1.45
African
1.44
AFRICA
1.43
afric
1.43
africa
1.40
africano
1.38
africa
1.37
Activations Density 0.200%