INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    =require
    -0.06
     PDO
    -0.06
    alamat
    -0.06
     Hz
    -0.06
    Actualizar
    -0.06
    FSIZE
    -0.06
    -Nazi
    -0.06
     vyz
    -0.06
     xhr
    -0.06
     TEX
    -0.06
    POSITIVE LOGITS
     use
    0.07
    career
    0.07
     friendly
    0.07
    friendly
    0.07
     Use
    0.07
    utter
    0.07
     affiliates
    0.06
    Actions
    0.06
    ик
    0.06
    vice
    0.06
    Act Density 0.006%

    No Known Activations