INDEX
    Explanations

    references to national entities or national-level issues

    New Auto-Interp
    Negative Logits
    ØŃداث
    -0.17
    anuts
    -0.17
    ä¸Ī
    -0.16
    431
    -0.16
    inz
    -0.15
    TINGS
    -0.15
    angen
    -0.15
    kova
    -0.14
    важ
    -0.14
    ëij
    -0.14
    POSITIVE LOGITS
    /local
    0.19
    eres
    0.19
    /reg
    0.17
    ych
    0.16
     Tos
    0.15
    elow
    0.15
    okes
    0.15
    opes
    0.14
    ities
    0.14
    andum
    0.14
    Act Density 0.028%

    No Known Activations