INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    aglia
    -0.08
     knife
    -0.08
    (load
    -0.07
    (Register
    -0.07
    (for
    -0.07
     помогает
    -0.07
    (us
    -0.07
     livres
    -0.07
     Buchanan
    -0.07
    (REG
    -0.07
    POSITIVE LOGITS
     dropout
    0.08
     creen
    0.08
     percaya
    0.08
     oran
    0.08
     autoplay
    0.08
    -opacity
    0.08
     gain
    0.08
    átu
    0.08
    0.08
    .opacity
    0.08
    Act Density 0.001%

    No Known Activations