INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    άνει
    -0.07
     difficulty
    -0.07
    وفي
    -0.06
    enefit
    -0.06
    -view
    -0.06
    ลงท
    -0.06
    	df
    -0.06
     امکان
    -0.06
     exclusively
    -0.06
    _end
    -0.06
    POSITIVE LOGITS
    Snap
    0.07
    евого
    0.07
     fotos
    0.06
     Snap
    0.06
     exig
    0.06
     india
    0.06
     advertisement
    0.06
    ่ต
    0.06
     quiere
    0.06
     pratique
    0.06
    Act Density 0.032%

    No Known Activations