INDEX
    Explanations

    promotional messages or offers

    references to promotional content or marketing materials

    New Auto-Interp
    Negative Logits
    GO
    -0.87
    acht
    -0.76
     Ness
    -0.73
     Apostles
    -0.71
    ña
    -0.68
    yer
    -0.68
    ighed
    -0.68
    gger
    -0.66
    ectar
    -0.66
    kos
    -0.65
    POSITIVE LOGITS
     promotions
    1.00
     promotional
    0.99
    eatures
    0.88
     calendars
    0.84
     banners
    0.82
    andise
    0.81
     promotion
    0.80
     promo
    0.79
    wcs
    0.79
     broch
    0.78
    Act Density 0.015%

    No Known Activations