INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    									 
    -0.07
    ้ท
    -0.07
    idata
    -0.07
    kbd
    -0.07
    	 
    -0.06
    arsimp
    -0.06
    ським
    -0.06
     pentru
    -0.06
    .Bean
    -0.06
    _locals
    -0.06
    POSITIVE LOGITS
    ona
    0.07
     όλα
    0.06
     Prix
    0.06
     Liga
    0.06
     Whites
    0.06
     Gree
    0.06
    0.06
    alia
    0.06
     advisory
    0.06
     محصول
    0.06
    Act Density 0.001%

    No Known Activations