INDEX
    Explanations

    Avoiding the word "description"

    New Auto-Interp
    Negative Logits
    wijs
    -0.08
     licences
    -0.08
     subsidi
    -0.08
     Lizenz
    -0.08
     ফের
    -0.08
     frying
    -0.08
     rech
    -0.08
    党委
    -0.07
     lice
    -0.07
    ыны
    -0.07
    POSITIVE LOGITS
     अंत
    0.08
     गर्द
    0.07
     depiction
    0.07
     embroidery
    0.07
     aesthetically
    0.07
    'or
    0.07
     portrays
    0.07
     वार
    0.07
     monoch
    0.07
    ात्मक
    0.07
    Act Density 0.004%

    No Known Activations