INDEX
    Explanations

    references to specific cities, especially their names

    New Auto-Interp
    Negative Logits
    dit
    -0.16
    nox
    -0.15
    ders
    -0.15
    δÏģα
    -0.15
    ylene
    -0.14
    izzard
    -0.14
    riott
    -0.14
    ding
    -0.14
    edy
    -0.14
    idon
    -0.14
    POSITIVE LOGITS
    esion
    0.15
     Hib
    0.14
    agh
    0.14
    ruh
    0.14
    ActionBar
    0.14
    å
    0.14
    eref
    0.14
    iona
    0.14
    asin
    0.13
     McL
    0.13
    Act Density 0.011%

    No Known Activations