INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     objetivo
    -0.07
    En
    -0.07
    kanı
    -0.07
     """
    -0.06
    classnames
    -0.06
    ционные
    -0.06
     rac
    -0.06
    раниц
    -0.06
    	ST
    -0.06
    €“
    -0.06
    POSITIVE LOGITS
     Opr
    0.07
    -max
    0.06
     waterproof
    0.06
    #
    0.06
     ساعت
    0.06
     navigator
    0.06
    restrict
    0.06
     Petr
    0.06
     corrected
    0.06
     pokrač
    0.06
    Act Density 0.000%

    No Known Activations