MySQL to StarRocks Example

Here we use an example to introduce how to use Apache InLong creating MySQL -> StarRocks data synchronization.

Deployment

Install InLong

Before we begin, we need to install InLong. Here we provide two ways:

Add Connectors

Download the connectors corresponding to Flink 1.13, and after decompression, place sort-connector-starrocks-[version]-SNAPSHOT.jar in /inlong-sort/connectors/ directory.

Install StarRocks

Please refer to the Installation Tutorial on the Apache StarRocks official website

Cluster Initialize

When all containers are successfully started, you can access the InLong dashboard address http://localhost, and use the following default account to log in.

  1. User: admin
  2. Password: inlong

Create Cluster Tag

Click [Clusters] -> [ClusterTags] -> [Create] on the page to specify the cluster label name and responsible person. Create Cluster Tag

MySQL to StarRocks Example - 图2caution

default_cluster is the default ClusterTags reported by each component. If you decide to use a different name, make sure to update the corresponding tag configuration accordingly.

Register Pulsar Cluster

Click [Clusters] -> [Cluster] -> [Create] on the page to register Pulsar Cluster. Create Pulsar Cluster

MySQL to StarRocks Example - 图4note

The ClusterTags selects the newly created default_cluster, the Pulsar cluster deployed by docker:

Service URL is pulsar://pulsar:6650, Admin URL is http://pulsar:8080.

Register StarRocks DataNodes

Click [DataNodes] -> [Create] on the page to register StarRocks DataNodes. Create StarRocks DataNode

MySQL to StarRocks Example - 图6note

  • Please do not fill in http:// for LOAD URL, just fill in IP:PORT.

Create Task

Create Data Streams Group

Click [Synchronization] → [Create] on the page and input the Group ID, Stream ID and Full database migration: Create Group Stream

Create Data Source

In the data source, click [New] → [MySQL] to configure the source name, address, databases and tables information. Create Stream_Source

MySQL to StarRocks Example - 图9note

  • When the read mode is selected as Full + Incremental, the existing data in the table will also be collected, but not in the Incremental mode.
  • The table white list format is <dbName>.<tableName> and supports regular expressions.

Create Data Sink

In the data sink, click [New] → [StarRocks] to configure the sink name, database name, table name and created StarRocks data node.

Create data object

Approve Data Stream

Click [Approval] -> [MyApproval] -> [Approval] -> [Ok].

Approve

Back to [Synchronization] page, wait for [success].

Success

Test Data

Send Data

  1. #!/bin/bash
  2. # MySQL info
  3. DB_HOST="mysql"
  4. DB_USER="root"
  5. DB_PASS="inlong"
  6. DB_NAME="test"
  7. DB_TABLE="source_table"
  8. # Insert data in a loop
  9. for ((i=1; i<=1000; i++))
  10. do
  11. # Generate data
  12. id=$i
  13. name="name_$i"
  14. # Build an insert SQL
  15. query="INSERT INTO $DB_TABLE (id, name) VALUES ($id, '$name');"
  16. # Execute insert SQL
  17. mysql -h $DB_HOST -u $DB_USER -p$DB_PASS $DB_NAME -e "$query"
  18. done

Modify the variables in the script according to the actual environment, and add a total of 1000 pieces of data to the source_table:

Result Source

Verify Data

Enter StarRocks, check data in sink_table.

Result Sink

You can also view audit data on the page:

Result Sink