Speaker: Volker Markl, Technische University Berlin
Title: Big Data looks tiny from the Stratosphere
The talk will describe Stratosphere, a new flavor of data processor that goes beyond the popular map/reduce data programming paradigm. Stratosphere employs a more generic programming model based on second order functions and has language frontends in Java, Scala, and a scripting language called Meteor. Key design points of Stratosphere are the automatic optimization, parallelization, and hardware adaption of complex data analysis pipelines, from laptops to compute clusters, as well as native support for iterative data analysis programs in addition to relational operators and user defined functions. Through the concepts of bulk and workset iterations, Stratosphere can process information extraction and integration operations together with deep analytics in a single system, subsuming many specialized systems for graph processing or machine learning in a single environment. Stratosphere runs either standalone, natively in compute clusters, or without special installation in any Hadopp cluster via YARN. Stratosphere is available open-source at www.stratosphere.eu. Next to the current technologies available in the open source release of Stratosphere, we also will talk about our current research, in particular about our work on optimistic, overhead-free fault-tolerance.
Volker Markl is a Full Professor and Chair of the Database Systems and Information Managemen (DIMA) group at the Technische Universität Berlin (TU Berlin) as well as an adjunct status-only professor at the University of Toronto. Earlier in his career, Dr. Markl lead a research group at FORWISS, the Bavarian Research Center for Knowledge-based Systems in Munich, Germany, and was a Research Staff member & Project Leader at the IBM Almaden Research Center in San Jose, California, USA. Dr. Markl has published numerous research papers on indexing, query optimization, lightweight information integration, and scalable data processing. He holds 6 patents, has transferred technology into several commercial products, and advises several companies and startups in the Big Data space. He currently serves as the secretary of the VLDB Endowment.