INDEX
Explanations
references to political and social controversies related to Israel and Palestine
New Auto-Interp
Negative Logits
uger
-0.16
Grid
-0.15
iller
-0.14
LSU
-0.14
ocial
-0.14
vale
-0.14
obe
-0.14
empor
-0.14
Dao
-0.13
grid
-0.13
POSITIVE LOGITS
Palestine
0.36
Palestinian
0.32
Pale
0.32
Israel
0.32
BDS
0.31
Palest
0.31
Palestinians
0.29
Jew
0.28
Palestin
0.28
Israeli
0.27
Activations Density 0.085%