INDEX
Explanations
video titles containing the phrase "MUST WATCH."
the phrase "MUST WATCH" and related calls to action regarding viewing content
New Auto-Interp
Negative Logits
urally
-0.71
academia
-0.70
bats
-0.70
sucker
-0.67
ura
-0.67
jury
-0.66
pulp
-0.66
Dee
-0.66
nowhere
-0.66
utra
-0.64
POSITIVE LOGITS
WATCH
1.24
Thumbnails
0.97
VIDEOS
0.97
WATCHED
0.93
Watching
0.88
FILE
0.82
]}
0.80
IFF
0.80
CLASSIFIED
0.79
LIST
0.79
Activations Density 0.006%