INDEX
Explanations
countries, governments, respective
New Auto-Interp
Negative Logits
האט
0.38
ष्णा
0.38
ząc
0.38
Geschä
0.38
सीआरपीएफ
0.37
انگلیسی
0.36
unger
0.36
শাহ
0.35
荷兰
0.35
outen
0.34
POSITIVE LOGITS
各国
1.00
governments
0.94
countries
0.89
देशों
0.88
країн
0.88
respective
0.87
local
0.85
countries
0.84
सरकारों
0.84
Countries
0.84
Activations Density 0.049%