w3lib

Python library of web-related functions

Showing:

Popularity

Downloads/wk

0

GitHub Stars

336

Maintenance

Last Commit

1mo ago

Contributors

36

Package

Dependencies

1

License

BSD

Categories

Readme

=====

w3lib

.. image:: https://secure.travis-ci.org/scrapy/w3lib.png?branch=master :target: http://travis-ci.org/scrapy/w3lib

.. image:: https://img.shields.io/codecov/c/github/scrapy/w3lib/master.svg :target: http://codecov.io/github/scrapy/w3lib?branch=master :alt: Coverage report

Overview

This is a Python library of web-related functions, such as:

  • remove comments, or tags from HTML snippets
  • extract base url from HTML snippets
  • translate entites on HTML strings
  • convert raw HTTP headers to dicts and vice-versa
  • construct HTTP auth header
  • converting HTML pages to unicode
  • sanitize urls (like browsers do)
  • extract arguments from urls

Requirements

Python 3.6+

Install

pip install w3lib

Documentation

See http://w3lib.readthedocs.org/

License

The w3lib library is licensed under the BSD license.

Rate & Review

Great Documentation0
Easy to Use0
Performant0
Highly Customizable0
Bleeding Edge0
Responsive Maintainers0
Poor Documentation0
Hard to Use0
Slow0
Buggy0
Abandoned0
Unwelcoming Community0
100