INDEX
Explanations
references to government officials and ministries
New Auto-Interp
Negative Logits
bor
-0.15
ứa
-0.15
ÏĢει
-0.14
aran
-0.14
ạn
-0.14
endor
-0.14
osta
-0.13
336
-0.13
unit
-0.13
acity
-0.13
POSITIVE LOGITS
ial
0.25
IAL
0.18
wide
0.18
aires
0.15
lake
0.15
:min
0.15
erva
0.14
ochond
0.14
ials
0.14
но
0.14
Activations Density 0.023%