INDEX
Explanations
references to territorial governance and military forces in Iraq
New Auto-Interp
Negative Logits
azzi
-0.16
å®ħ
-0.15
ublic
-0.15
Wing
-0.15
çıŃ
-0.14
icher
-0.14
artin
-0.14
iu
-0.14
rait
-0.13
skirts
-0.13
POSITIVE LOGITS
ujet
0.17
ATAB
0.16
atum
0.15
æĨ
0.15
Bes
0.15
/window
0.15
isches
0.15
327
0.15
opot
0.14
imized
0.14
Activations Density 0.011%