Real-Time Streaming Data Enrichment with Database CDC | 2/5

Описание к видео Real-Time Streaming Data Enrichment with Database CDC | 2/5

Learn how to keep reference data up to date while simultaneously enriching your data streams, with Apache Flink. We’ll take an in-depth look at how Apache Flink streaming join works with real-time event data and the database row level, using Change Data Capture (CDC).

In this series, Anand Shah (Data Analytics and Streaming Specialist at AWS) will help you build a modern data streaming architecture for a real-time gaming leaderboard. This architecture includes data ingestion, real-time enrichment with database change data capture (CDC), data processing, as well as computing, storing and visualizing the results. You will also learn advanced streaming analytics techniques, such as the control channel method for A/B testing, updating features and parameters with zero downtime, and how to handle late arrival of data. Anand will also talk you through the process of data de-duplication, as well as how you can store historical data for replay on-demand. 🎉

🌟 Get started with Amazon Managed Service for Apache Flink today, to build and run your fully managed Apache Flink applications on AWS! 👉 https://aws.amazon.com/managed-servic...

🔗 Github repository: https://github.com/build-on-aws/real-...

Resources used in this video:
🔗 AWS CDK Overview: https://docs.aws.amazon.com/cdk/v2/gu...
🔗 Apache Flink CDC Connectors: https://github.com/apache/flink-cdc
🔗 Apache Flink Joins: https://nightlies.apache.org/flink/fl...
🔗 Modern Streaming Data Architecture on AWS: https://docs.aws.amazon.com/whitepape...

Follow AWS Developers:
👾 Twitch:   / aws  
🐦 Twitter:   / awsdevelopers  
💻 LinkedIn:   / aws  

Follow Anand Shah:
🐦 Twitter:   / anandshah110  
💻 LinkedIn:   / anandshah110  

00:00 Intro
00:35 What will you learn?
01:28 What is Change Data Capture (CDC)?
02:33 Keeping Apache Flink state up-to-date
03:20 Demo: CDK source code walkthrough and deploy
06:56 Demo: Building the CDC connector and using Managed Flink Notebooks
09:16 Demo: Challenge 2 - Querying player demographics and CDC join
10:15 Conclusion

 #FlinkCDC, #ManagedServiceForApacheFlink, #StateManagement

Комментарии

Информация по комментариям в разработке