Apache Oozie Workflow Scheduler for Hadoop

Apache Oozie工作流调度程序。

Overview

概述

Oozie is a workflow scheduler system to manage Apache Hadoop jobs.

Oozie是一个管理Apache Hadoop作业的工作流调度系统。

Oozie Workflow jobs are Directed Acyclical Graphs (DAGs) of actions.

Oozie工作流作业是针对操作的有向的非点击图(DAGs)。

Oozie Coordinator jobs are recurrent Oozie Workflow jobs triggered by time (frequency) and data availability.

Oozie协调器工作是由时间(频率)和数据可用性触发的重复的Oozie工作流作业。

Oozie is integrated with the rest of the Hadoop stack supporting several types of Hadoop jobs out of the box (such as Java map-reduce, Streaming map-reduce, Pig, Hive, Sqoop and Distcp) as well as system specific jobs (such as Java programs and shell scripts).

Oozie与Hadoop堆栈的其他部分集成在一起,支持多种类型的Hadoop任务(如Java map-reduce、流图减少、猪、Hive、Sqoop和Distcp)以及系统特定的工作(如Java程序和shell脚本)。

Oozie is a scalable, reliable and extensible system.

Oozie是一个可扩展、可靠和可扩展的系统。

Developers interested in getting more involved with Oozie may join the mailing lists, report bugs, retrieve code from the version control system, and make contributions.

想要更多地参与Oozie的开发人员可以加入邮件列表,报告bug,从版本控制系统中检索代码,并做出贡献。