INDEX
    Explanations

    countries, governments, respective

    New Auto-Interp
    Negative Logits
     האט
    0.38
    ष्णा
    0.38
    ząc
    0.38
     Geschä
    0.38
     सीआरपीएफ
    0.37
     انگلیسی
    0.36
    unger
    0.36
     শাহ
    0.35
    荷兰
    0.35
    outen
    0.34
    POSITIVE LOGITS
    各国
    1.00
     governments
    0.94
     countries
    0.89
     देशों
    0.88
     країн
    0.88
     respective
    0.87
     local
    0.85
    countries
    0.84
     सरकारों
    0.84
    Countries
    0.84
    Act Density 0.049%

    No Known Activations