INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     appropr
    -0.09
     bede
    -0.08
     ah
    -0.08
     Nero
    -0.08
    Assoc
    -0.08
    PC
    -0.07
    pec
    -0.07
     alk
    -0.07
    corn
    -0.07
     téh
    -0.07
    POSITIVE LOGITS
     siquiera
    0.12
     tampoco
    0.12
     hinder
    0.10
     nor
    0.10
    /or
    0.08
     locality
    0.08
    ebook
    0.08
     intrusive
    0.08
     hence
    0.08
    withstanding
    0.08
    Act Density 0.007%

    No Known Activations