A simple set of wrappers around gitpython for creating pandas dataframes out of git data. The project is centered around two primary objects:
A Repository object contains a single git repo, and is used to interact with it. A ProjectDirectory references a directory in your filesystem which may have in it multiple git repositories. The subdirectories are all walked to find any child repos, and any analysis is aggregated up from all of those into a single output (pandas dataframe).
Current functionality includes:
Please see examples for more detailed usage. The image above is generated using the repository object's cumulative blame function on stravalib.
Many of these functions are fairly intensive to run, so we also now support optional memory or redis based caching of certain methods.
Git-pandas supports python 2.7+ and 3.3+. To install use:
pip install git-pandas
Docs can be found here: http://wdm0006.github.io/git-pandas/
We are looking for contributors, so if you are interested, please review our contributor guidelines in CONTRIBUTING.md, which includes some proposed starter issues, or if you have an idea of your own, send us a pull request.
This is BSD licensed (see LICENSE.md)