INDEX
    Explanations

    names of people and places, particularly those with unique diacritical marks or accents

    New Auto-Interp
    Negative Logits
    lar
    -0.30
    ìķĺ
    -0.26
    ìķĺëĭ¤
    -0.26
    ca
    -0.24
    ban
    -0.23
    ça
    -0.22
    va
    -0.21
    dır
    -0.20
    ra
    -0.20
    ta
    -0.20
    POSITIVE LOGITS
    zet
    0.25
    inde
    0.19
    де
    0.19
    ény
    0.19
    Åij
    0.19
    ye
    0.19
    iben
    0.19
    ÑĢе
    0.18
    etty
    0.18
    ben
    0.18
    Act Density 0.008%

    No Known Activations