INDEX
    Explanations

    Israel-Palestine conflict

    New Auto-Interp
    Negative Logits
     itſelf
    -0.84
     dezelve
    -0.75
     zoude
    -0.74
    reszcie
    -0.71
     picioare
    -0.71
     möge
    -0.69
     zijne
    -0.69
    geführt
    -0.68
     zelve
    -0.68
     يتيمه
    -0.68
    POSITIVE LOGITS
    ServiceModel
    0.52
    secre
    0.51
     dat
    0.50
     ho
    0.49
     che
    0.46
    toContain
    0.46
     ran
    0.45
     String
    0.44
     adel
    0.44
     cor
    0.44
    Act Density 0.022%

    No Known Activations