INDEX
Explanations
terms related to Zionism and the Israeli-Palestinian conflict
New Auto-Interp
Negative Logits
etti
-0.16
ENDOR
-0.16
tainment
-0.16
гл
-0.15
thouse
-0.15
ifi
-0.14
ideos
-0.14
ÑĢаÑģп
-0.14
ymous
-0.14
好
-0.14
POSITIVE LOGITS
otas
0.16
713
0.16
rub
0.14
æĤ
0.13
çŁ
0.13
bate
0.13
medium
0.13
YTE
0.13
ráž
0.13
ather
0.13
Activations Density 0.050%