A lightweight opinionated etl framework halfway between plain scripts and apache airflow git hub mara mara pipelines a lightweight opinionated etl framework halfway between plain scripts and
Luigi is a python module that helps you build complex pipelines of batch jobs it handles dependency resolution workflow management visualization etc it also comes with hadoop support built in
Twitter 39 s collection of lzo and protocol buffer related hadoop pig hive and h base code git hub twitter elephant bird twitter 39 s collection of lzo and protocol buffer related hadoop
Subscribe to get resources directly to your inbox. You won't receive any spam! ✌️