Spring Cloud Data Flow

Spring Cloud Data Flow is a cloud-native orchestration service for composable microservice applications on modern runtimes. With Spring Cloud Data Flow, developers can create and orchestrate data pipelines for common use cases such as data ingest, real-time analytics, and data import/export.

Quick Start
Fork me on GitHub

Spring Cloud Data Flow is the cloud native redesign of Spring XD – a project that aimed to simplify development of Big Data applications. The stream and batch modules from Spring XD are refactored as Spring Boot based stream and task data microservice applications respectively. These applications are now autonomous deployment units and they can "natively" run in modern runtimes such as Cloud Foundry, Apache YARN, Apache Mesos, and Kubernetes.

Spring Cloud Data Flow offers a collection of patterns and best practices for microservices-based distributed streaming and batch data pipelines.


  • Orchestrate applications across a variety of distributed modern runtimes including: Cloud Foundry, Apache YARN, Apache Mesos, and Kubernetes
  • Separate runtime dependencies backed by ‘spring profiles’
  • Consume streaming and batch data microservices as maven or docker artifacts
  • Create, unit-test, troubleshoot and manage data microservices in isolation
  • Develop using: DSL, Shell, REST-APIs, Dashboard, and Flo
  • Take advantage of metrics, health checks, and remote management at each data microservice level
  • Scale stream and batch pipelines without interrupting data flows

Quick Start

Step 1 - Clone Spring Cloud Data Flow repository

Step 2 - Start Kafka [used as: messaging middleware]

Step 3 - Build Project mvn clean package

Step 4 - Launch ‘Local’ Server [spring-cloud-dataflow-server-local/target] java -jar spring-cloud-dataflow-server-local-[VERSION].jar

Step 5 - Launch Shell [spring-cloud-dataflow-shell/target] java -jar spring-cloud-dataflow-shell-[VERSION].jar

Step 6 - Import all the out-of-the-box application coordinates in bulk dataflow:>app import --uri http://bit.ly/stream-applications-kafka-maven

Step 7 - Create ‘ticktock’ Stream dataflow:>stream create ticktock --definition "time | log" --deploy

You'll notice the following in ‘Local’ Server console.

2016-07-18 22:08:24.777  INFO 73058 --- [nio-9393-exec-9] o.s.c.d.spi.local.LocalAppDeployer       : deploying app ticktock.log instance 0
   Logs will be in /var/folders/c3/ctx7_rns6x30tq7rb76wzqwr0000gp/T/spring-cloud-dataflow-5011521526937452211/ticktock-1468904904769/ticktock.log
2016-07-18 22:08:25.081  INFO 73058 --- [nio-9393-exec-9] o.s.c.d.spi.local.LocalAppDeployer       : deploying app ticktock.time instance 0
   Logs will be in /var/folders/c3/ctx7_rns6x30tq7rb76wzqwr0000gp/T/spring-cloud-dataflow-5011521526937452211/ticktock-1468904905074/ticktock.time

Step 8 - Verify the ‘ticktocks’: tail -f /var/folders/ ... /ticktock.log/stdout_0.log

Step 9 - Launch Dashboard at: http://localhost:9393/dashboard

Spring Cloud Data Flow Server Implementations

Server Type Stable Release SNAPSHOT Release
Local Server 1.0.1.RELEASE[docs] 1.1.0.BUILD-SNAPSHOT[docs]
Cloud Foundry Server 1.0.0.RELEASE[docs] 1.0.1.BUILD-SNAPSHOT[docs]
Apache YARN Server 1.0.0.RELEASE[docs] 1.0.1.BUILD-SNAPSHOT[docs]
Kubernetes Server 1.0.0.RELEASE[docs] 1.0.1.BUILD-SNAPSHOT[docs]
Apache Mesos Server 1.0.0.RELEASE[docs] 1.0.1.BUILD-SNAPSHOT[docs]

Building Blocks of Spring Cloud Data Flow

Spring Cloud Data Flow builds upon several projects and the top-level building blocks of the ecosystem are listed in the following visual representation. Each project represents a core capability and they evolve in isolation, with separate release cadences - follow the links to find more details about each project.

Spring Cloud Data Flow Local Server
Spring Cloud Data Flow Cloud Foundry Server
Spring Cloud Data Flow Apache Yarn Server
Spring Cloud Data Flow Kubernetes Server
Spring Cloud Data Flow Apache Mesos Server

REST-APIs / Shell / DSL
Flo for Spring Cloud Data Flow
Spring Flo
Spring Cloud Data Flow - Core

↓     Uses     ↓

Spring Cloud Deployer - Service Provider Interface (SPI)

↑     Implements     ↑

Spring Cloud Deployer Local
Spring Cloud Deployer Cloud Foundry
Spring Cloud Deployer Yarn
Spring Cloud Deployer Kubernetes
Spring Cloud Deployer Mesos

↓     Deploys     ↓

Spring Cloud Stream App Starters
Spring Cloud Task App Starters
Spring Cloud Stream
Spring Cloud Task

↓     Uses     ↓

Spring Integration
Spring Boot
Spring Batch