INDEX
Explanations
terms related to governance and socio-economic structures
New Auto-Interp
Negative Logits
odos
-0.16
377
-0.15
çij
-0.15
utin
-0.14
unma
-0.14
inya
-0.14
aklı
-0.14
leÅŁik
-0.13
pyx
-0.13
лиÑı
-0.13
POSITIVE LOGITS
nÃło
0.19
whose
0.18
à¹Ģà¸ķà¸Ńร
0.15
sez
0.15
ivent
0.14
اباÙĨ
0.14
ilst
0.14
adian
0.14
HLT
0.14
inha
0.14
Activations Density 0.221%