[docs]:Add Flink CDC integration doc under Flink #3347
Conversation
|
|
||
| - A running **Fluss cluster** (CoordinatorServer + TabletServer). See [Deploying with Docker](../install-deploy/deploying-with-docker.md) for setup instructions. | ||
| - A running **Flink cluster** with the required connector JARs. See [Getting Started with Flink](getting-started.md) for Flink setup. | ||
| - The required connector JARs placed under `<FLINK_HOME>/lib/`. The examples below use MySQL as the source, but other databases (PostgreSQL, Oracle, etc.) are also supported — see [Further Reading](#further-reading) for the full list of connectors. |
There was a problem hiding this comment.
Would it make sense to use PostgreSQL as the source example instead of MySQL? The original issue (#1939) describes the use case with PostgreSQL, and it's more widely adopted internationally, which might resonate better with readers.
| Submit the pipeline using the Flink CDC CLI: | ||
|
|
||
| ```shell | ||
| ./bin/flink-cdc.sh mysql-to-fluss.yaml |
There was a problem hiding this comment.
./bin/flink-cdc.sh comes from the standalone Flink CDC distribution (flink-cdc-3.x.y-bin.tar.gz), not the standard Flink distribution. It might be helpful to mention in the Prerequisites section that users need to download the Flink CDC distribution separately and set FLINK_HOME.
There was a problem hiding this comment.
./bin/flink-cdc.shcomes from the standalone Flink CDC distribution (flink-cdc-3.x.y-bin.tar.gz), not the standard Flink distribution. It might be helpful to mention in the Prerequisites section that users need to download the Flink CDC distribution separately and set FLINK_HOME.
I will change this source with pgsql
| --- | ||
| sidebar_label: Flink CDC | ||
| title: Flink CDC Integration | ||
| sidebar_position: 9 |
There was a problem hiding this comment.
sidebar_position: 9 conflicts with the existing options.md in the same directory. Would you mind changing it to 10 to avoid non-deterministic sidebar ordering?
|
@beryllw PTAL |
Purpose
Linked issue: close #1945
Brief change log
Added a new flink-cdc-intergartion under "Flink Engine" in website/docs/engine-flink/flink-cdc-intergartion.md.
Introduced two integration methods:
Flink CDC Pipeline Connector — defining synchronization pipeline via a YAML file.
Flink CDC SQL Connector — syncing PostgreSQL data using SQL statements.
Added the official Flink CDC documentation reference link.
Tests
API and Format
Documentation