A lightweight opinionated etl framework halfway between plain scripts and apache airflow git hub mara mara pipelines a lightweight opinionated etl framework halfway between plain scripts and
Luigi is a python module that helps you build complex pipelines of batch jobs it handles dependency resolution workflow management visualization etc it also comes with hadoop support built in
Spring xd makes it easy to solve common big data problems such as data ingestion and export real time analytics and batch workflow orchestration git hub spring projects spring xd spring xd ma
Twitter 39 s collection of lzo and protocol buffer related hadoop pig hive and h base code git hub twitter elephant bird twitter 39 s collection of lzo and protocol buffer related hadoop
Subscribe to get resources directly to your inbox. You won't receive any spam! ✌️