INDEX
    Explanations

    references to promotional offers or discounts

    New Auto-Interp
    Negative Logits
    arton
    -0.69
    -0.67
    owan
    -0.63
     amassed
    -0.61
     appar
    -0.60
    -0.59
    ellar
    -0.55
    wine
    -0.55
    orde
    -0.53
     upro
    -0.53
    POSITIVE LOGITS
    ering
    0.85
    spring
    0.85
    enses
    0.83
    ices
    0.79
    topic
    0.76
    hand
    0.75
    cial
    0.75
    endas
    0.74
    loading
    0.72
    ctr
    0.71
    Act Density 0.015%

    No Known Activations