INDEX
    Explanations

    references to countries, specifically Morocco

    references to Morocco and Algeria

    New Auto-Interp
    Negative Logits
    ellation
    -0.85
    rophe
    -0.82
    sworth
    -0.79
    eve
    -0.79
    ritch
    -0.75
    ext
    -0.72
    icle
    -0.72
    ttle
    -0.70
    riet
    -0.70
    weeney
    -0.70
    POSITIVE LOGITS
     Morocco
    1.16
     Algeria
    0.98
     Alger
    0.93
     Sahara
    0.91
     Moroccan
    0.90
     Tunisia
    0.84
     Arabia
    0.83
     Ré
    0.78
    CLASSIFIED
    0.78
     Arabian
    0.78
    Act Density 0.013%

    No Known Activations