INDEX
Explanations
references to countries, specifically Morocco
references to Morocco and Algeria
New Auto-Interp
Negative Logits
ellation
-0.85
rophe
-0.82
sworth
-0.79
eve
-0.79
ritch
-0.75
ext
-0.72
icle
-0.72
ttle
-0.70
riet
-0.70
weeney
-0.70
POSITIVE LOGITS
Morocco
1.16
Algeria
0.98
Alger
0.93
Sahara
0.91
Moroccan
0.90
Tunisia
0.84
Arabia
0.83
Ré
0.78
CLASSIFIED
0.78
Arabian
0.78
Activations Density 0.013%