INDEX
Explanations
references to Africa or entities associated with it
New Auto-Interp
Negative Logits
agers
-0.18
emente
-0.16
ering
-0.16
Lesser
-0.15
essel
-0.15
cco
-0.15
ea
-0.15
uld
-0.14
neys
-0.14
e
-0.14
POSITIVE LOGITS
ghan
0.30
rique
0.24
rika
0.23
raid
0.21
onso
0.20
(AF
0.20
af
0.19
Afrika
0.18
yon
0.18
.af
0.18
Activations Density 0.014%