Meetup + Slides: Apache Geode and Apache Apex Integration
The video contains three talks centered around integration of Apache Geode and Apache Apex after the introduction.
Apache Geode provides a database-like consistency model, reliable transaction processing and a shared-nothing architecture to maintain very low latency performance with high concurrency processing.
Apache Apex, http://apex.incubator.apache.org/, is an open source stream processing and next generation analytics platform incubating at the Apache Software Foundation. Apex is Hadoop native and was built from ground up for scalability, low-latency processing, high availability and operability.
The talks are
1. Pivotal's effort on Apache Geode
Nitin discusses rationale behind Apache Geode and walk through the leadership role played by Pivotal in OSS efforts of Apache Geode.
Speaker: Nitin Lamba leads product management at Ampool, a company he co-founded. Prior to Ampool, he worked at a robotics company, which builds ocean drones using a real-time Java platform.
2. Apex & Geode In-memory computation, storage & analysis
Apache Apex & Apache Geode are two very promising incubating open source projects, combined they promise to fill gaps of existing big data analytics platforms.
Apache Geode provides a database-like consistency model, reliable transaction processing and a shared-nothing architecture to maintain very low latency performance with high concurrency processing.
In this session we will talk about use cases and on-going efforts of integrating Apex and Geode to build scallable & fault tolerant RealTime streaming applications that ingest from various sources and egress to Geode.
Use case 1 - Geode as data store to write streaming processed data computed by Apex which is powering user applications or dashboards.
Use case 2 - Apex application reading data from Geode cache and use it for data processing.
Use case 3 - Apex platform's operator checkpointing in Geode to improve performance of Apex batch operations.
Speaker: Ashish Tadose is a technical lead at Ampool, and worked at PubMatic, as a Lead Engineer, Big Data & Analytics, where he led a team driving large scale data ingestion and real-time streaming analytics solutions.
3. AdTech Pipeline: Kafka to Apex to Geode
Demonstration of a common big data AdTech data pipeline using Kafka, Apex, and Geode. The data source is Kafka, and Geode is used to store results of computations. Data will be ingested into Hadoop using Kafka input connector from Apache Malhar. Computations and transformations will then be performed on this data by an Apache Apex application that runs natively in Hadoop. The results are then loaded into Geode for UI queries.
Speaker: Vitthal Gogate is a Hadoop veteran who has worked on various Hadoop components. His work experience includes Senior Research staff engineer at IBM; Solutions Architect role in Yahoo! Hadoop; Chief Architect and Product Manager of Pivotal Hadoop Distribution; Architect for Hadoop Installation, Management & Monitoring product at Hortonworks.
Other Videos By DataTorrent
Other Statistics
Apache Statistics For DataTorrent
There are 242 views in 1 video for Apache. There's close to 2 hours worth of content for Apache published on his channel, roughly 1.46% of the content that DataTorrent has uploaded to YouTube.