INDEX
Explanations
geographical references to countries in Africa
New Auto-Interp
Negative Logits
ÙĨدÙĤ
-0.16
autos
-0.15
ialect
-0.15
tro
-0.15
baugh
-0.15
torino
-0.15
venez
-0.14
_Detail
-0.14
جÙĩ
-0.14
TemplateName
-0.14
POSITIVE LOGITS
Mal
0.31
Mal
0.24
Lil
0.24
mal
0.22
Chip
0.22
MCP
0.20
abwe
0.20
MAL
0.20
Livingston
0.19
Chip
0.19
Activations Density 0.003%