Carnegie Mellon University

Electrical and Computer Engineering

College of Engineering

Course Information

18-656SV: Data Intensive Workflow Development for Software Engineers




Many software systems nowadays have become increasingly data intensive and data centered applications. Manipulating comprehensive datasets and heterogeneous data sources typically requires composing and executing a series of computational or data manipulation steps, called a workflow. A data-oriented workflow is a formal way of defining, automating, repeating and adapting multi-step computational procedures driven by data events. The primary objective of the course is to help students develop skills in engineering data-oriented workflows, in the context of service-oriented software engineering, big data, cloud computing, Internet of Things, social networking, and mobile computing. Core topics include: data-oriented workflow theory, models, languages, techniques, architectures, systems, tools; workflow discovery, reuse, recommendation, orchestrations and choreographies; workflow properties and data dependencies; data provenance capture, storage, retrieval, and mining; workflow execution, allocation, and optimization on cloud; workflow as a service, as well as collaborative data analytics on the Internet. Literature survey and study of state-of-the-art technologies, as well as both individual and group project work, are essential ingredients of this class. Research and practical projects build upon one another.

Please note that this course is intended for ECE master students with a concentration in Software Engineering and will satisfy the “Systems” area requirement.

Corequisites: 18-655

Last Modified: 2018-05-03 4:30PM

Current session:

This course is currently being offered.

Semesters offered:

  • Fall 2018
  • Fall 2017
  • Fall 2016
  • Fall 2015