INDEX
    Explanations

    references to geographical or cultural entities

    New Auto-Interp
    Negative Logits
    ero
    -0.17
    ipi
    -0.16
    hani
    -0.16
    hoot
    -0.15
    adh
    -0.15
    isse
    -0.15
    rias
    -0.15
    roud
    -0.14
    RITE
    -0.14
    orts
    -0.14
    POSITIVE LOGITS
    lÃŃn
    0.19
    ür
    0.18
    aire
    0.17
    entral
    0.17
    vez
    0.17
    wick
    0.17
    celand
    0.16
    weis
    0.16
    enuity
    0.16
    aan
    0.16
    Act Density 0.020%

    No Known Activations