Michael Scharnagl 1/4/2019

Using Puppeteer to crawl pages and save them as Markdown files

Read Original

This technical article details a method for migrating a WordPress site to a static site generator by using Puppeteer, a Node.js library, to programmatically crawl web pages, extract article content from the DOM, and save it as Markdown files. It includes code examples for launching a browser, navigating to URLs, and handling errors.

Using Puppeteer to crawl pages and save them as Markdown files

Comments

No comments yet

Be the first to share your thoughts!

Browser Extension

Get instant access to AllDevBlogs from your browser

Top of the Week

2
Designing Design Systems
TkDodo Dominik Dorfmeister 2 votes
3
Introducing RSC Explorer
Dan Abramov 1 votes
5
Fragments Dec 11
Martin Fowler 1 votes
6
Adding Type Hints to my Blog
Daniel Feldroy 1 votes
7
Refactoring English: Month 12
Michael Lynch 1 votes
9