INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Carol
    -0.06
    GX
    -0.06
    Fi
    -0.06
     دهد
    -0.06
     пред
    -0.06
    正常
    -0.06
     GridBagConstraints
    -0.06
    Lifetime
    -0.06
     zwarte
    -0.06
     Hun
    -0.06
    POSITIVE LOGITS
     внеш
    0.07
    _reserve
    0.06
    /raw
    0.06
     grandes
    0.06
    [][
    0.06
     junior
    0.06
    .grad
    0.06
     surprises
    0.06
    ({},
    0.06
    /Object
    0.06
    Act Density 0.000%

    No Known Activations