data:image/s3,"s3://crabby-images/61492/61492a9b3396b08f150e997a1b9120798bc920d1" alt="Puppeteer type download"
We can combine the child process module with our Puppeteer script and download files in parallel. Child process is how Node.js handles parallel programming. We can fork multiple child_proces in Node. Our CPU cores can run multiple processes at the same time. 💡 Learn more about the single threaded architecture of node here
data:image/s3,"s3://crabby-images/65e13/65e131588ff286de05bef3f2749146631a790e07" alt="puppeteer type download puppeteer type download"
data:image/s3,"s3://crabby-images/a0110/a0110b41b6730287be0c294603614a732c63cc2d" alt="puppeteer type download puppeteer type download"
Therefore if we have to download 10 files each 1 gigabyte in size and each requiring about 3 mins to download then with a single process we will have to wait for 10 x 3 = 30 minutes for the task to finish. It can only execute one process at a time. You see Node.js in its core is a single-threaded system.
data:image/s3,"s3://crabby-images/1226e/1226e2c3c7fbfb36e0f5adf7058bb22b41569168" alt="puppeteer type download puppeteer type download"
However, if you have to download multiple large files things start to get complicated. In this next part, we will dive deep into some of the advanced concepts.
data:image/s3,"s3://crabby-images/61492/61492a9b3396b08f150e997a1b9120798bc920d1" alt="Puppeteer type download"