data-describe

data⎰describe: Pythonic EDA Accelerator for Data Science

Showing:

Popularity

Downloads/wk

0

GitHub Stars

285

Maintenance

Last Commit

7mos ago

Contributors

14

Package

Dependencies

31

License

Apache-2.0

Categories

Readme

PyPI status PyPI license Downloads

PyPI version shields.io PyPI pyversions codecov

data ⎰ describe

data-describe is a Python toolkit for Exploratory Data Analysis (EDA). It aims to accelerate data exploration and analysis by providing automated and polished analysis widgets.

For more examples of data-describe in action, see the Quick Start Tutorial.

Main Features

data-describe implements the following basic features:

FeatureDescription
Data SummaryCurated data summary
Data HeatmapData variation and missingness heatmap
Correlation MatrixCorrelation heatmaps with categorical support
Distribution PlotsGenerate histograms, violin plots, bar charts
ScatterplotsGenerate scatterplots and evaluate with scatterplot diagnostics
Cluster AnalysisAutomated clustering and plotting
Feature RankingEvaluate feature importance using tree models

Extended Features

data-describe is always looking to elevate the standard for Exploratory Data Analysis. Here are just a few that are implemented:

  • Dimensionality Reduction Methods
  • Sensitive Data (PII) Redaction
  • Text Pre-processing / Topic Modeling
  • Big Data Support

Installation

data-describe can be installed using pip:

pip install data-describe

Getting Started

import data_describe as dd
help(dd)

See the User Guide for more information.

Project Status

data-describe is currently in beta status.

Contributing

data-describe welcomes contributions from the community.

Rate & Review

Great Documentation0
Easy to Use0
Performant0
Highly Customizable0
Bleeding Edge0
Responsive Maintainers0
Poor Documentation0
Hard to Use0
Slow0
Buggy0
Abandoned0
Unwelcoming Community0
100