INDEX
Explanations
phrases related to international relations and cooperation
New Auto-Interp
Negative Logits
atab
-0.17
here
-0.17
borg
-0.17
몰
-0.15
bourg
-0.15
jong
-0.15
_Internal
-0.14
ÑģÑİ
-0.14
pai
-0.14
unify
-0.14
POSITIVE LOGITS
bilateral
0.48
Bil
0.40
bil
0.36
bil
0.29
mutual
0.29
ilateral
0.27
trade
0.25
mutually
0.24
relations
0.24
friendship
0.24
Activations Density 0.236%