SUMMARY: The purpose of this project is to practice web scraping by extracting specific pieces of information from a website. The Python web scraping code leverages the Selenium module.
INTRODUCTION: Real Money is a website dedicated to investment news and blog articles written by financial professionals. The website features numerous professionals with various trading specialties and expertise. The script automatically traverses the news listing for a site contributor and captures the high-level metadata of his/her blogs by storing them in a CSV output file.
Starting URLs: https://realmoney.thestreet.com/author/269/jim-cramer/all.html
The source code and HTML output can be found here on GitHub.
You must be logged in to post a comment.