SUMMARY: This project aims to practice web scraping by extracting specific pieces of information from a website. The web scraping Python code leverages the BeautifulSoup module.
Dr. Jason Brownlee’s Machine Learning Mastery hosts its tutorial lessons at https://machinelearningmastery.com/blog. The purpose of this exercise is to practice web scraping by gathering the blog entries from Machine Learning Mastery’s web pages. This iteration of the script automatically traverses the web pages to capture all articles and store the captured information in a CSV output file for sorting and filtering.
Starting URL: https://machinelearningmastery.com/blog
The source code and HTML output can be found here on GitHub.
You must be logged in to post a comment.