INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -dollar
    -0.07
     Continuous
    -0.07
    Fu
    -0.07
    rtype
    -0.06
    .Prop
    -0.06
     af
    -0.06
     exercitation
    -0.06
     voksen
    -0.06
    shown
    -0.06
    .N
    -0.06
    POSITIVE LOGITS
    0.06
     Edinburgh
    0.06
     işe
    0.06
    (Msg
    0.06
     منظ
    0.06
     Giáo
    0.06
    	register
    0.06
    /task
    0.06
    ervisor
    0.06
    0.06
    Act Density 0.000%

    No Known Activations