INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Siz
    -0.75
     practicar
    -0.73
     soared
    -0.68
    beds
    -0.68
    Trent
    -0.67
     Interrupt
    -0.66
    ณะ
    -0.66
    rua
    -0.65
    Mutagenicity
    -0.65
    -0.65
    POSITIVE LOGITS
     Junk
    0.67
     Goes
    0.65
     sprawia
    0.64
    0.64
     wine
    0.63
     Updates
    0.63
     године
    0.62
     connectivity
    0.62
     igual
    0.62
    idata
    0.61
    Act Density 0.067%

    No Known Activations