INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     specially
    -0.08
    stdafx
    -0.08
     دادن
    -0.08
     قرض
    -0.08
    ませ
    -0.07
    	JPanel
    -0.07
     plantations
    -0.07
     pools
    -0.07
     مستقیم
    -0.07
     destinado
    -0.07
    POSITIVE LOGITS
     violations
    0.08
     brief
    0.08
     fram
    0.08
     Viol
    0.08
     publik
    0.07
    Viol
    0.07
     higher
    0.07
     ensure
    0.07
     compet
    0.07
     pelos
    0.07
    Act Density 0.000%

    No Known Activations