INDEX
    Explanations

    geographic references to countries, particularly in Africa

    New Auto-Interp
    Negative Logits
    ión
    -0.17
    صÙģ
    -0.16
    ocket
    -0.15
    kip
    -0.15
    igel
    -0.15
    hud
    -0.14
    FFE
    -0.14
    wc
    -0.14
    ần
    -0.14
    iones
    -0.14
    POSITIVE LOGITS
    atra
    0.19
    onde
    0.18
    ehler
    0.18
    ussy
    0.17
     lá»ĩ
    0.17
    uzu
    0.15
     лÑİ
    0.15
    outu
    0.15
     inline
    0.15
    ampp
    0.15
    Act Density 0.007%

    No Known Activations