blog header image - picture of Josh

Python Blogger Post Collector

blog header image - picture of Josh

Published on .

I worked as an undergraduate research assistant in Dr. Arlen Moller’s Motivation and Health Psychology Lab for a semester. This summer, Dr. Moller reached out to me about a project he was planning which involved analyzing the contents of a specific blog. The blog in question is on blogspot.com, which hosts blogs built with Google’s Blogger service.

As far as I could tell, there was no straightforward way to pull all of the posts and comments off of a Blogger site. So, I wrote a Blogger collector in Python. You can find my source code [here](https://github.com/jbguberman/blogger_scraper. The repo has a readme that I think should make using the collector pretty straightforward. If you have any questions, though, please feel free to leave a comment here or to open an issue on GitHub.

(Disclaimer: The current incarnation of the script may or may not allow you to rotate between numerous API keys, thereby allowing you to avoid running up against API rate-limits. If this is the case, comment out that line. It shouldn’t let you do that, and it’s probably against Google’s TOS.)