This is an introductory level talk about Apache Flink: a multi-purpose Big Data analytics framework leading a movement towards the unification of batch and stream processing or stream processing-first in the open source. With the many technical innovations it brings along with its unique vision and philosophy, it is considered the 4 G (4th Generation) of Big Data Analytics frameworks providing the only hybrid (Real-Time Streaming + Batch) open source distributed data processing engine supporting many use cases.
In this talk, you will learn more about:
1. What is Apache Flink stack? Its streaming dataflow execution engine, APIs and domain-specific libraries for batch, streaming, machine learning and graph processing.
2. How Apache Flink integrates with Hadoop and other open source tools for data input and output as well as deployment?
3. Why Apache Flink is an alternative to Apache Hadoop MapReduce, Apache Storm and Apache Spark?
4. How Apache Flink is used at Capital One?
PS: Please, visit https://hadoopsummit.uservoice.com/forums/332079-the-future-of-apache-hadoop/suggestions/10848465-overview-of-apache-flink-the-4g-of-big-data-analy, click on 'Vote', enter your email. You'll be done in less than 30 seconds!