INDEX
Explanations
references to cultural institutions and authorities in Russia and the Netherlands
New Auto-Interp
Negative Logits
untreated
-0.15
-0.15
berry
-0.14
untranslated
-0.14
offline
-0.13
Africa
-0.13
africa
-0.13
atan
-0.13
ìļ°ìĬ¤
-0.13
elen
-0.13
POSITIVE LOGITS
Republic
0.35
Republic
0.34
republic
0.25
ÐłÐµÑģпÑĥбли
0.21
åħ±åĴĮåĽ½
0.20
Cumhuriyeti
0.20
جÙħÙĩÙĪØ±
0.19
Kingdom
0.19
republiky
0.19
REP
0.19
Activations Density 0.053%