INDEX
    Explanations

    video-related terms and cues to prompt viewers to watch

    references to video content and sharing actions

    New Auto-Interp
    Negative Logits
     recl
    -0.66
     reconciliation
    -0.63
     libel
    -0.60
     academia
    -0.60
    Ͻ
    -0.60
    umped
    -0.58
    sole
    -0.58
     Galile
    -0.57
     diss
    -0.57
     Sap
    -0.57
    POSITIVE LOGITS
     Videos
    1.07
     Thumbnails
    1.04
     WATCHED
    0.89
     âĢº
    0.84
     VIDEOS
    0.83
    iques
    0.82
    icult
    0.78
    Video
    0.76
     Loading
    0.71
     Video
    0.70
    Act Density 0.008%

    No Known Activations