sayn

Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).

Showing:

Popularity

Downloads/wk

0

GitHub Stars

94

Maintenance

Last Commit

12d ago

Contributors

6

Package

Dependencies

15

License

Apache-2.0

Categories

Readme

SAYN logo

SAYN is a modern data processing and modelling framework. Users define tasks (incl. Python, automated SQL transformations and more) and their relationships, SAYN takes care of the rest. It is designed for simplicity, flexibility and centralisation in order to bring significant efficiency gains to the data engineering workflow.

Use Cases

SAYN can be used for multiple purposes across the data engineering and analytics workflows:

  • Data extraction: complement tools such as Fivetran or Stitch with customised extraction processes.
  • Data modelling: transform raw data in your data warehouse (e.g. aggregate activity or sessions, calculate marketing campaign ROI, etc.).
  • Data science: integrate and execute data science models.

Key Features

SAYN has the following key features:

  • YAML based DAG (Direct Acyclic Graph) creation. This means all analysts, including non Python proficient ones, can easily add tasks to ETL processes with SAYN.
  • Automated SQL transformations: write your SELECT statement. SAYN turns it into a table/view and manages everything for you.
  • Jinja parameters: switch easily between development and product environment and other tricks with Jinja templating.
  • Python tasks: use Python scripts to complement your extraction and loading layer and build data science models.
  • Multiple databases supported.
  • and much more... See the Documentation.

Design Principles

SAYN aims to empower data engineers and analysts through its three core design principles:

  • Simplicity: data processes should be easy to create, scale and maintain. So your team can focus on data transformation instead of writing processes. SAYN orchestrates all your tasks systematically and provides a lot of automation features.
  • Flexibility: the power of data is unlimited and so should your tooling. SAYN supports both SQL and Python so your analysts can choose the most optimal solution for each process.
  • Centralisation: all analytics code should live in one place, making your life easier and allowing dependencies throughout the whole analytics process.

Quick Start

$ pip install sayn
$ sayn init test_sayn
$ cd test_sayn
$ sayn run

This is it! You completed your first SAYN run on the example project. Continue with the Tutorial: Part 1 which will give you a good overview of SAYN's true power!

Release Updates

If you want to receive update emails about SAYN releases, you can sign up here.

Support

If you need any help with SAYN, or simply want to know more, please contact the team at sayn@173tech.com.

License

SAYN is open source under the Apache 2.0 license.


Made with ❤️ by 173tech.

Rate & Review

Great Documentation0
Easy to Use0
Performant0
Highly Customizable0
Bleeding Edge0
Responsive Maintainers0
Poor Documentation0
Hard to Use0
Slow0
Buggy0
Abandoned0
Unwelcoming Community0
100
No reviews found
Be the first to rate

Alternatives

No alternatives found

Tutorials

No tutorials found
Add a tutorial