INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ivot
    -0.07
     브라
    -0.06
    Ông
    -0.06
     nghiệp
    -0.06
     olumlu
    -0.06
    Ab
    -0.06
     canh
    -0.06
     vzpom
    -0.06
     slaves
    -0.06
     Obamacare
    -0.05
    POSITIVE LOGITS
     calibration
    0.07
     modifies
    0.07
    0.07
    ===
    0.07
    Requests
    0.07
    prec
    0.06
    (proxy
    0.06
    \Facades
    0.06
    spin
    0.06
    'email
    0.06
    Act Density 0.004%

    No Known Activations