INDEX
    Explanations

    words from non-English languages

    New Auto-Interp
    Negative Logits
     مشين
    -0.65
    יצוני
    -0.63
    awtextra
    -0.61
    UnusedPrivate
    -0.58
     >=",
    -0.56
    Sucesor
    -0.55
    IgnoreCase
    -0.54
     propOrder
    -0.52
     חיצוני
    -0.52
    ֹת
    -0.52
    POSITIVE LOGITS
     Israeli
    0.96
     Israel
    0.91
    Israeli
    0.87
    Israel
    0.85
     Israël
    0.80
     isra
    0.78
     Aviv
    0.77
     Israelis
    0.77
    Israël
    0.76
    anyahu
    0.76
    Act Density 0.283%

    No Known Activations