.. image:: https://readthedocs.org/projects/scrapelib/badge/?version=latest :target: https://readthedocs.org/projects/scrapelib/?badge=latest :alt: Documentation Status
scrapelib is a library for making requests to less-than-reliable websites, it is implemented
(as of 0.7) as a wrapper around
scrapelib originated as part of the
Open States <http://openstates.org/>_
project to scrape the websites of all 50 state legislatures and as a result
was therefore designed with features desirable when dealing with sites that
have intermittent errors or require rate-limiting.
Advantages of using scrapelib over alternatives like httplib2 simply using requests as-is:
requests <http://python-requests.org>_ library.
Written by James Turk email@example.com, thanks to Michael Stephens for initial urllib2/httplib2 version
See https://github.com/jamesturk/scrapelib/graphs/contributors for contributors.
import scrapelib s = scrapelib.Scraper(requests_per_minute=10)
while True: s.get('http://example.com')