2011-03-23 by Stefan Urbanek

Introduction

Freshly brewed clean data with analytical taste – that is what Data Brewery is for. The Python framework will allow you to:

  • stream structured data from various sources (CSV, XLS, SQL database, Google spreadsheet) to various structured targets
  • create analytical streams using flow-based programming: connect processing nodes together and let the structured data flow through them
  • measure data properties, such as data quality or numerical statistics
  • do advanced data mining in the future such as clustering or classification

You can use Brewery for analytical automation or just for ad-hoc analytical processing.

Project page is at databrewery.org. Source repository can be found at:

Documentation with examples and node reference can be found here.

Happy brewing!

Brewery stream example