INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     entreg
    -0.08
     Gesch
    -0.08
    h
    -0.07
     Germ
    -0.07
    лага
    -0.07
     tenho
    -0.07
    -0.07
     vent
    -0.07
     germ
    -0.07
    درجة
    -0.07
    POSITIVE LOGITS
    fik
    0.08
     pf
    0.07
     workplace
    0.07
    0.07
     Kemp
    0.07
     администра
    0.07
    AMI
    0.07
     overhaul
    0.07
    -angle
    0.07
    0.07
    Act Density 0.010%

    No Known Activations