INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    قر
    -0.09
     sweating
    -0.09
    خش
    -0.08
     Diagnosis
    -0.08
     مورد
    -0.08
     Paris
    -0.08
    سس
    -0.08
    لالة
    -0.08
    سيل
    -0.08
    ্�
    -0.07
    POSITIVE LOGITS
     Lithuan
    0.09
    0.08
    mt
    0.08
     potr
    0.08
    metrical
    0.08
    immt
    0.08
    	elem
    0.08
    Dv
    0.08
    imu
    0.08
    imaal
    0.08
    Act Density 0.010%

    No Known Activations