INDEX
    Explanations

    phrases indicating a call to action to watch a video

    references to videos and related multimedia content

    New Auto-Interp
    Negative Logits
     academia
    -0.74
     Wonderland
    -0.71
     reconciliation
    -0.68
     Gibbs
    -0.68
    REF
    -0.66
     Aberdeen
    -0.65
     virtue
    -0.65
    sole
    -0.64
     plagiar
    -0.64
     Mole
    -0.64
    POSITIVE LOGITS
     Thumbnails
    1.12
     WATCHED
    1.08
     Videos
    1.02
     VIDEOS
    0.89
    Video
    0.83
    natureconservancy
    0.81
     âĩ
    0.81
    icult
    0.80
    iques
    0.79
     âĢº
    0.78
    Act Density 0.009%

    No Known Activations