INDEX
Explanations
words from non-English languages
New Auto-Interp
Negative Logits
مشين
-0.65
יצוני
-0.63
awtextra
-0.61
UnusedPrivate
-0.58
>=",
-0.56
Sucesor
-0.55
IgnoreCase
-0.54
propOrder
-0.52
חיצוני
-0.52
ֹת
-0.52
POSITIVE LOGITS
Israeli
0.96
Israel
0.91
Israeli
0.87
Israel
0.85
Israël
0.80
isra
0.78
Aviv
0.77
Israelis
0.77
Israël
0.76
anyahu
0.76
Activations Density 0.283%