INDEX
    Explanations

    words or segments related to specific geographic locations or names

    New Auto-Interp
    Negative Logits
     ru
    -0.16
    iterr
    -0.15
    esub
    -0.14
    à¸Ļะ
    -0.14
     defaults
    -0.14
    sy
    -0.14
    inery
    -0.14
     champagne
    -0.14
    irsch
    -0.13
    hma
    -0.13
    POSITIVE LOGITS
    pson
    0.19
    erville
    0.16
    sville
    0.15
    anlı
    0.15
    tur
    0.14
    ванов
    0.14
    ãĥĪãĥ«
    0.14
    wan
    0.14
    anghai
    0.14
    Ļæ±Ł
    0.14
    Act Density 0.058%

    No Known Activations