INDEX
Explanations
references to the United Nations and its related activities
New Auto-Interp
Negative Logits
federal
-0.15
xuyên
-0.15
ãĥ³ãĥĶ
-0.15
oop
-0.14
Bundes
-0.14
zier
-0.14
oine
-0.14
stdint
-0.13
baugh
-0.13
ulia
-0.13
POSITIVE LOGITS
UN
0.19
UN
0.18
WithMany
0.16
rts
0.16
Viol
0.16
ãĥĦ
0.15
dashes
0.15
dash
0.15
issan
0.15
Earth
0.14
Activations Density 0.028%