INDEX
Explanations
acknowledgments of funding and gratitude in research papers
Israel and Jerusalem
New Auto-Interp
Negative Logits
Louisiana
-0.55
kasarigan
-0.54
برانيه
-0.53
יצוני
-0.53
'{@-0.52
neous
-0.51
***!
-0.50
grine
-0.50
deoxy
-0.49
ByExample
-0.49
POSITIVE LOGITS
Israel
1.11
Israeli
1.10
Israel
0.98
IDF
0.98
Israelis
0.94
Hebrew
0.93
Israeli
0.93
Netanyahu
0.93
Jerusalem
0.91
ISRAEL
0.85
Activations Density 0.080%