INDEX
    Explanations

    negative statements or criticisms

    negative constructions and expressions of prohibition

    New Auto-Interp
    Negative Logits
     biod
    -0.69
    soDeliveryDate
    -0.68
     PD
    -0.66
    quickShipAvailable
    -0.65
     LIN
    -0.65
     Wid
    -0.64
    ãĤ¼ãĤ¦ãĤ¹
    -0.62
     Bonds
    -0.62
     veins
    -0.61
     Valkyrie
    -0.61
    POSITIVE LOGITS
     emulate
    0.97
    erers
    0.81
    iners
    0.72
    umbn
    0.72
    regnancy
    0.72
     imitate
    0.71
     apologize
    0.70
     aspire
    0.70
     behave
    0.70
    icably
    0.70
    Act Density 0.057%

    No Known Activations