INDEX
    Explanations

    phrases directing users to leave comments or interact below a given content

    references to comments or interactions in the comment section of a document

    New Auto-Interp
    Negative Logits
    Freedom
    -0.71
    uay
    -0.70
    Sense
    -0.70
    olly
    -0.69
    eg
    -0.67
    ggle
    -0.65
     CTR
    -0.64
    Choice
    -0.64
    Ring
    -0.64
    ãĤ¸
    -0.63
    POSITIVE LOGITS
     below
    0.97
     veter
    0.93
    below
    0.83
     plateau
    0.81
     eleph
    0.76
     tradem
    0.75
    estyles
    0.73
    hirt
    0.72
     carbohyd
    0.72
     crest
    0.72
    Act Density 0.017%

    No Known Activations