INDEX
Explanations
mentions of Israel and its related entities
New Auto-Interp
Negative Logits
Kopp
-0.80
помним
-0.73
Boyz
-0.73
Jacoby
-0.72
skyl
-0.69
Oscill
-0.68
Mori
-0.68
Bryant
-0.67
fabrics
-0.67
Liar
-0.66
POSITIVE LOGITS
Israel
1.24
Israel
1.14
Israeli
1.04
ISRAEL
1.01
israel
1.00
RAEL
0.96
Israeli
0.95
Israël
0.93
Israelis
0.87
WriteBarrier
0.78
Activations Density 0.121%