Author: Niklas Baumstark
dryscrape is a lightweight web scraping library for Python. It uses a headless Webkit instance to evaluate Javascript on the visited pages. This enables painless scraping of plain web pages as well as Javascript-heavy “Web 2.0” applications like Facebook.
It is built on the shoulders of capybara-webkit's webkit-server. A big thanks goes to thoughtbot, inc. for building this excellent piece of software!
dryscrape.start_xvfb()
to
easily start Xvfb.headers
function in
a backwards-incompatible way: It now returns a list of (key, value)
pairs instead of a dictionary.The library has been confirmed to work on the following platforms:
Other unixoid systems should work just fine.
Windows is not officially supported, although dryscrape should work with cygwin.
Documentation can be found at dryscrape's ReadTheDocs page.
Quick installation instruction:
# pip install dryscrape
If you have any problems with this software, don't hesitate to open an issue on Github or open a pull request or write a mail to niklas baumstark at Gmail.
Version | Tag | Published |
---|---|---|
1.0 | 7yrs ago | |
0.9.1 | 7yrs ago |