INDEX
    Explanations

    phrases related to instructions and user engagement in promotional content

    New Auto-Interp
    Negative Logits
     Monfieur
    -1.03
     Efq
    -1.01
     itſelf
    -0.99
     pleaſure
    -0.97
     propOrder
    -0.96
    IsContent
    -0.94
     Anſ
    -0.93
    tonsoft
    -0.93
     ModelExpression
    -0.93
     Houſe
    -0.90
    POSITIVE LOGITS
     Join
    0.75
     Get
    0.75
     Visit
    0.74
     Learn
    0.73
    Visit
    0.69
    Join
    0.68
     Discover
    0.67
     join
    0.67
     Find
    0.67
     Watch
    0.67
    Act Density 0.179%

    No Known Activations