INDEX
Explanations
names related to Middle Eastern individuals or groups, particularly those associated with conflicts or politics
terms related to military groups and individuals involved in conflicts
New Auto-Interp
Negative Logits
Wilde
-0.84
Chloe
-0.82
Titanic
-0.80
Robo
-0.76
Poles
-0.75
Poland
-0.73
Wilhelm
-0.73
Nadu
-0.73
Polish
-0.70
Columbus
-0.69
POSITIVE LOGITS
awi
1.35
zbollah
1.28
qqa
1.27
abi
1.23
aq
1.16
adi
1.12
azeera
1.09
ayn
1.07
Iraq
1.05
raq
1.05
Activations Density 0.140%