Active'17

First International Workshop on Data Management on Virtualized Active Systems.

Objective

The objective of this one-day workshop is to bring researchers and practitioners together in order to investigate opportunities in exploiting virtualized active (compute-enabled) technologies such as active memory, active network, and active storage for accelerating data-intensive workloads. Furthermore, this workshop aims at investigating issues in realizing active capabilities (enabled by hardware accelerators such as SSDs, GPUs, FPGAs, and ASICs) in the entire system stack running on cloud. The workshop aims at providing a forum for academia and industry to exchange ideas through research and position papers.

Scope

The recent popularity of Big Data applications has renewed interest in developing and optimizing data-intensive workloads. In addition to the traditional scientific computing domain, Big Data phenomena are observed in a variety of new domains such as Internet of Things (IoT), social analytic, personalized health and precision medicine, bioinformatics, energy informatics, and emergency response and disaster management. The Big Data workloads are characterized by unprecedented volumes and velocity of data, both historic and transient, very high-speed data flows (e.g., from sensor networks), and diverse variety of data from completely unstructured text documents, to structured relational tables or matrices, to photos and videos. Analyzing vast quantities of such complex data (partially powered by machine learning) is becoming as important as the traditional massive-scale data management.

Unfortunately, existing approaches to solve data-intensive problems are woefully inadequate to address the challenges raised by the Big Data applications. Specifically, these approaches require data to be processed to be moved near the computing resources. These data movement costs can be prohibitive for large data sets such as those observed in the aforementioned workloads. One way to address this problem is to bring virtualized computing resources closer to data, whether it is at rest or in motion. The premise of "active" systems is a new holistic view of the system in which every data medium (whether volatile or non-volatile) and every communication channel becomes compute-enabled.

Although prototypes of systems with active technologies are currently available, there is a very limited exploitation of their capabilities in real-life problems. The proposed workshop aims to evaluate different aspects of the active systems stack and understand the impact of active technologies (including but not limited to hardware accelerators such as SSDs, GPUs, FPGAs, and ASICs) on different applications workloads. Specifically, the workshop aims to understand the role of modern hardware to enable active medium (whether network, storage, or memory) over the entire path and the lifecycle of data, especially as today's database system opt for hierarchies of storage and memory. Furthermore, we aim to revisit the interplay between algorithmic modeling, compiler and programming languages, virtualized runtime systems and environments, and hardware implementations, for effective exploitation of active technologies.

Topics of Interest

Topics of interest include but are not limited to:
  • Data Management Issues in Active Systems (e.g., active network, storage, and memory)
  • Data Management Issues in Software-Hardware-System Co-design
  • Active Technologies (e.g., SSDs, GPUs, FPGAs, and ASICs) in Co-design Architectures
  • Query Orchestration and Execution Models in Co-design Architectures
  • Enabling Partial Computation or Best Effort Computation in Co-design Architectures
  • Offloading Computation to Accelerators in Co-processor Design
  • Placing Accelerator on the Data Path in Co-placement Design
  • Programming Methodologies for Data-intensive Workloads on Active Technologies
  • Virtualizing Active Technologies on Cloud (e.g., Scalability and Security)
  • Exploiting Active Technologies in Modern Databases (e.g., NoSQL and NewSQL)
  • Extending Runtime of Big Data Systems (e.g., Spark, Hadoop) with Active Technologies
  • Autonomic Tuning for Data Management Workloads in Co-design Architectures
  • Algorithms and Performance Models for Active Memory and Storage Sub-systems
  • Novel Applications of Low-Power Modern Processors, GPUs, FPGAs, and ASICs
  • Novel Applications of Transactional Memory in Co-design Architectures
  • Workload-aware System Co-design for Emerging Applications (e.g., Internet-of-Things, Personalized Health, and Precision Medicine)

Program

Opening Remarks (8:45-9:00am)


Session 1: HardBD Keynote (9:00-10:00am)

  • Andrew Putnam, "The Configurable Cloud -- Accelerating Hyperscale Datacenter Services with FPGAs" [Slides]

Break (10:00-10:30am)


Session 2: Active Invited Academic Talks (10:30-12:00pm)

  • Xiaodong Zhang, "Enabling Effective Utilization of GPUs for Data Mamagement Systems" [Abstract, Slides]
  • Stratis D. Viglas, "Processing declarative queries through generating imperative code in managed runtimes" [Abstract, Slides]
  • Bingsheng He, "Data Management Systems on Future Hardware: Challenges and Opportunities" [Abstract, Slides]

Lunch (12:00-1:30pm)


Session 3: HardBD Papers (1:30-2:30pm)

  • Jianting Zhang and Le Gruenwald, "Parallel Selectivity Estimation for Optimizing Multidimensional Spatial Join Processing on GPUs"
  • Marcus Pinnecke, David Broneske, Gabriel Campero Durand and Gunter Saake, "Are Databases Fit for Hybrid Workloads on GPUs? A Storage Engine’s Perspective"

Session 4: Active Invited Indsutry Talks (2:30-3:30pm)

  • Roger Moussalli, "Tradeoffs and Considerations in the Design of Accelerators for Database Applications" [Abstract]
  • Eva Sitaridi, "Hardware Acceleration of Database Analytics" [Abstract, Slides]

Break (3:30-4:00pm)


Session 5: Joint Active/HardBD Panel - Next Big Problems (4:00-5:00pm) [Slides]


Closing Remarks (5:00-5:15pm)

Organization

Workshop Co-Chairs

  • Mohammad Sadoghi (Purdue University)
  • Rajesh R. Bordawekar (IBM T.J. Watson Research Center)

Publicity Chair

  • Kaiwen Zhang (Technical University of Munich)

Program Committee

  • Nipun Agarwal (Oracle)
  • Spyros Blanas (Ohio State University)
  • Khuzaima Daudjee (University of Waterloo)
  • Peter M. Fischer (University of Freiburg)
  • Blake G. Fitch (IBM Research, Zurich)
  • Boris Glavic (Illinois Institute of Technology)
  • Hans-Arno Jacobsen (Middleware Systems Research Group)
  • Kajan Kanagaratnam (IBM, Toronto)
  • Tirthankar Lahiri (Oracle)
  • Mohammadreza Najafi (Technical University of Munich)
  • Ilia Petrov (TU Darmstadt)
  • Tilmann Rabl (TU Berlin)
  • Tiark Rompf (Purdue University)
  • Mohamed Sarwat (Arizona State University)
  • Divesh Srivastava (AT&T Labs Research)
  • Dina Thomas (Pure Storage)
  • Stratis Viglas (University of Edinburgh)

Important Dates

  • Paper submissions (Extended): December 23, 2016
  • Notification to authors: January 20, 2017
  • Camera-ready copy due: February 7, 2017
  • Workshops: April 22, 2017

Submission Instructions

All submissions must be prepared according to the ICDE formatting guidelines. All accepted papers will be published in the ICDE proceedings and will also become publicly available through the IEEE Xplore. Submitted papers can be of three kinds (all accepted papers will be presented at the workshop):

  • Regular Research Papers: These papers should report original research results or significant case studies. They should be at most 8 pages.
  • Position Papers: These papers should report novel research directions or identify challenging problems. They should be at most 4 pages.
  • Abstracts: These papers should report on preliminary and ongoing work or exploratory research directions. They should be at most 1 page.

Papers have to be submitted electronically as PDF files via EasyChair.