INDEX
    Explanations

    phrases related to subscribing to newsletters

    instances of the word "we" and related phrases that signify collective action or communication

    New Auto-Interp
    Negative Logits
     Kand
    -0.56
     Ukrain
    -0.56
     Emer
    -0.53
     Gentle
    -0.51
     Eleven
    -0.50
     Dunham
    -0.50
     Dull
    -0.48
     partName
    -0.48
    naire
    -0.48
     Lean
    -0.48
    POSITIVE LOGITS
    've
    0.61
    Have
    0.61
    astics
    0.59
    're
    0.59
    ighed
    0.56
    atered
    0.55
    asel
    0.54
    Movie
    0.53
    Got
    0.53
    bsp
    0.52
    Act Density 0.013%

    No Known Activations