gp

git-pandas

A wrapper around gitpython to produce pandas dataframes for analysis

Showing:

Popularity

Downloads/wk

0

GitHub Stars

174

Maintenance

Last Commit

2yrs ago

Contributors

5

Package

Dependencies

0

License

BSD

Categories

Readme

Git-Pandas

license Coverage Status travis status PyPI version downloads

Cumulative Blame

A simple set of wrappers around gitpython for creating pandas dataframes out of git data. The project is centered around two primary objects:

  • Repository
  • ProjectDirectory

A Repository object contains a single git repo, and is used to interact with it. A ProjectDirectory references a directory in your filesystem which may have in it multiple git repositories. The subdirectories are all walked to find any child repos, and any analysis is aggregated up from all of those into a single output (pandas dataframe).

Current functionality includes:

  • Commit history with extension and directory filtering
  • Edited files history with extension and directory filtering
  • Blame with extension and directory filtering
  • Branches
  • Tags
  • ProjectDirectory-level general information table
  • Approximate bus factor
  • Cumulative Blame as a time series
  • Github.com profile analysis via GitHubProfile object
  • Plotting helpers in utilities module
  • Punchcard dataframe and plotting utility
  • Filewise blame
  • File owner approximation
  • Estimation of hours spent per project or per author across projects

Please see examples for more detailed usage. The image above is generated using the repository object's cumulative blame function on stravalib.

Many of these functions are fairly intensive to run, so we also now support optional memory or redis based caching of certain methods.

Installation

Git-pandas supports python 2.7+ and 3.3+. To install use:

pip install git-pandas

Documentation

Docs can be found here: http://wdm0006.github.io/git-pandas/

Contributing

We are looking for contributors, so if you are interested, please review our contributor guidelines in CONTRIBUTING.md, which includes some proposed starter issues, or if you have an idea of your own, send us a pull request.

Projects Using Git-Pandas

License

This is BSD licensed (see LICENSE.md)

Rate & Review

Great Documentation0
Easy to Use0
Performant0
Highly Customizable0
Bleeding Edge0
Responsive Maintainers0
Poor Documentation0
Hard to Use0
Slow0
Buggy0
Abandoned0
Unwelcoming Community0
100
No reviews found
Be the first to rate

Alternatives

No alternatives found

Tutorials

No tutorials found
Add a tutorial