INDEX
    Explanations

    references to political tensions and conflicts, particularly involving Israel and Palestine

    New Auto-Interp
    Negative Logits
    imax
    -0.17
    occer
    -0.16
    amax
    -0.15
    Äħd
    -0.15
    unkt
    -0.14
    loi
    -0.14
     Wax
    -0.14
     ÐĴоз
    -0.14
    YTE
    -0.14
    ifica
    -0.14
    POSITIVE LOGITS
    eters
    0.14
     centr
    0.14
     cent
    0.14
    asse
    0.13
    ectar
    0.13
    æª
    0.13
    rada
    0.13
    ij
    0.13
    .asm
    0.13
    artner
    0.13
    Act Density 0.198%

    No Known Activations