INDEX
    Explanations

    references to the United Nations and its various programs and initiatives

    New Auto-Interp
    Negative Logits
    ery
    -0.15
    guard
    -0.14
    erton
    -0.14
    AKE
    -0.14
     Rog
    -0.13
    899
    -0.13
    NST
    -0.13
    board
    -0.13
     å
    -0.12
    俺ãģ¯
    -0.12
    POSITIVE LOGITS
    DP
    0.18
     UN
    0.18
    iversal
    0.16
    ited
    0.16
    avour
    0.16
    (Un
    0.16
    /world
    0.15
    ाà¤ĩà¤Ł
    0.15
    اÛĮت
    0.14
     Charter
    0.14
    Act Density 0.013%

    No Known Activations