Azkaban

Azkaban

Azkaban is a sophisticated workflow job scheduler tailored for managing Hadoop jobs by resolving job dependencies. It features an intuitive web interface for monitoring workflows and supports both solo-server and distributed modes, making it versatile for experimentation or scaling in production environments, particularly for ETL and data analytics tasks.

Top Azkaban Alternatives

Ad
StackScan

StackScan

Build targeted website lists by filtering domains based on the technologies they use. 50,000+ technologies across millions of domains.

StackScan Pte Ltd
1

Arcion

Arcion is a robust data pipeline software that enables users to deploy production-ready change data capture pipelines swiftly and effortlessly.

By: Arcion Labs From United States
2

Chalk

Chalk streamlines complex data workflows with its intuitive Python interface, enabling developers to effortlessly manage real-time data retrieval and sophisticated pipelines.

By: Chalk From United States
3

Datazoom

A robust video data platform, Datazoom integrates real-time data collection from various endpoints, including CDNs and media players.

By: Datazoom From United States
4

Data Taps

Data Taps is a sophisticated data pipeline software that facilitates real-time streaming analytics.

By: BoilingData From Finland
5

Adele

Adele streamlines the migration of SQL and ETL jobs to any cloud platform with precision and speed.

By: Adastra From Canada
6

Datastreamer

Datastreamer transforms web data workflows for organizations by automating data integration and real-time enrichment.

By: Datastreamer From Canada
7

Kestra

With its declarative YAML interface, users can build reliable workflows while managing data operations seamlessly...

By: Kestra From France
8

definity

It optimizes performance and minimizes costs by pinpointing waste, ensuring pipeline SLAs, and providing deep...

By: definity From United States
9

Meltano

With an extensive library of over 600 connectors, it allows seamless integration of databases, SaaS...

By: Meltano From United States
10

Dropbase

It allows teams to centralize offline data, import and clean files, and seamlessly export to...

By: Dropbase (YC W20) From United States
11

Nextflow

Its intuitive DSL streamlines the development and execution of complex workflows on cloud and cluster...

By: Seqera Labs From Spain
12

Key Ward

It automates data pipelines for machine learning and deep learning, enabling users to centralize, clean...

By: Key Ward From Germany
13

Prefect

Prefect Cloud serves as a command center, providing real-time monitoring, advanced scheduling, and customizable alerts...

By: Prefect From United States
14

GlassFlow

It processes up to 6M events/sec with minimal latency, integrating seamlessly with various data sources...

By: GlassFlow From Germany
15

Integrate.io

With over 220 transformations and 60-second CDC replication, it empowers both technical and non-technical users...

By: Integrate.io From United States

Top Azkaban Features

  • Batch workflow job scheduling
  • Job dependency resolution
  • User-friendly web interface
  • Solo-server mode for testing
  • Distributed multiple-executor mode
  • Embedded H2 database option
  • MySQL instance support
  • Master-slave database architecture
  • Robust production environment
  • Scalable multi-host setup
  • Easy workflow maintenance
  • Workflow tracking capabilities
  • ETL job automation
  • Data analytics job scheduling
  • Customizable job settings
  • Real-time job monitoring
  • Error handling and notifications
  • Job prioritization options
  • CLI for automation
  • Support for various plugins.