INDEX
    Explanations

    wording related to opt-in promotions or advertisements

    terms related to promotional content, offers, and updates

    New Auto-Interp
    Negative Logits
    closed
    -0.71
    mand
    -0.69
    lly
    -0.69
    urally
    -0.68
    geist
    -0.67
    Registered
    -0.64
    ograp
    -0.62
    stood
    -0.60
     Kamp
    -0.60
    agate
    -0.59
    POSITIVE LOGITS
    ctory
    0.83
    pring
    0.80
     occasional
    0.78
    ource
    0.72
     Schwarzenegger
    0.70
    piracy
    0.69
    poons
    0.67
    Subscribe
    0.67
    hips
    0.65
    ause
    0.64
    Act Density 0.020%

    No Known Activations