INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Africa
    -1.98
    Africa
    -1.77
     AFRICA
    -1.66
     africa
    -1.58
     África
    -1.48
    africa
    -1.35
     Afrika
    -1.20
     Афри
    -1.06
    GEBURTSDATUM
    -1.00
     africano
    -0.98
    POSITIVE LOGITS
    o
    0.64
    s
    0.61
    '
    0.60
     and
    0.59
    0.57
    a
    0.56
    e
    0.54
    i
    0.54
    an
    0.53
    يان
    0.52
    Act Density 0.157%

    No Known Activations