INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    	pre
    -0.08
     lights
    -0.08
    ియర్
    -0.07
     Cen
    -0.07
    -Er
    -0.07
    ந்த
    -0.07
    ంభ
    -0.07
     initializing
    -0.07
     smokers
    -0.07
    -0.07
    POSITIVE LOGITS
     اعتماد
    0.09
     रोजगार
    0.09
     Held
    0.08
    employment
    0.08
     trusty
    0.08
     pross
    0.08
    jenige
    0.08
     actually
    0.08
     nida
    0.07
    Monto
    0.07
    Act Density 0.005%

    No Known Activations