A presentation at DataEngBytes by Marta Paes
Change Data Capture (CDC) has become the standard to capture and propagate committed changes from a database to downstream consumers, for example to keep multiple datastores in sync and avoid common pitfalls such as dual writes (remember? “Friends don’t let friends do dual writes”).
Consuming these changelogs with Apache Flink used to be a pain, but the latest release (Flink 1.11) introduced not only support for CDC, but support for CDC from the comfort of your SQL couch. In this talk, we’ll demo how to use Flink SQL to easily process database changelog data generated with Debezium.
The following resources were mentioned during the presentation or are useful additional information.
Try out the demo used in the presentation!
A great talk from Gunnar Morling on the basics of CDC (with Debezium).
A hands-on training to get you started from scratch.
More on CDC from around the web.
Here’s what was said about this presentation on social media.