INDEX
Explanations
references to political tensions and conflicts, particularly involving Israel and Palestine
New Auto-Interp
Negative Logits
imax
-0.17
occer
-0.16
amax
-0.15
Äħd
-0.15
unkt
-0.14
loi
-0.14
Wax
-0.14
ÐĴоз
-0.14
YTE
-0.14
ifica
-0.14
POSITIVE LOGITS
eters
0.14
centr
0.14
cent
0.14
asse
0.13
ectar
0.13
æª
0.13
rada
0.13
ij
0.13
.asm
0.13
artner
0.13
Activations Density 0.198%