INDEX
    Explanations

    references to political figures and their roles within international organizations

    New Auto-Interp
    Negative Logits
     nationwide
    -0.14
    pek
    -0.14
     Nationwide
    -0.14
    à¥įवत
    -0.14
    .sponge
    -0.14
    ortic
    -0.14
    aÄį
    -0.14
    à¥ĩà¤
    -0.14
    æ¦
    -0.14
    HK
    -0.14
    POSITIVE LOGITS
     UN
    0.77
    UN
    0.68
     United
    0.61
    United
    0.55
    .UN
    0.48
     UNITED
    0.48
    _UN
    0.47
     UNS
    0.44
     UNESCO
    0.42
    UNIT
    0.41
    Act Density 0.342%

    No Known Activations