Data Loading Overview
SynxDB loads external data by converting it into external tables via loading tools. Then, it reads from or writes to these external tables to complete the external data loading process.
Data loading process
The general process of loading data into SynxDB is as follows:
Evaluate the data loading scenario (such as data source location, data types, and data volume) and choose the appropriate loading tool.
Configure and activate the loading tool.
Create an external table, specifying the loading tool protocol, data source address, and data format in the
CREATE EXTERNAL TABLE
statement. For details, see Load External Data Using Foreign Table.Once the external table is created, you can directly query the data using the
SELECT
statement or load the data into a table using theINSERT INTO SELECT
statement.
Loading tools and scenarios
SynxDB provides multiple data loading solutions, allowing you to choose different methods based on the data source.
Loading method/tool |
Data source |
Data format |
Parallel loading |
---|---|---|---|
|
|
No |
|
Local file system (local Segment nodes, only accessible by superusers) |
|
Yes |
|
Local host files or files accessible over the intranet |
|
Yes |
|
Bulk loading using gpload (using |
Local host files or files accessible over the intranet |
|
Yes |
Data fetched from web services or any source accessible via command line |
|
Yes |
|
Kafka FDW and Kafka Connector <load-data/load-data-using-kafka-connector> |
Kafka |
|
Yes |
|
|
Yes |
|
Hive Connector with |
Hive data warehouse |
|
Yes |
|
Depends on the connector. Common formats include: - Text, CSV, JSON - Avro, Parquet, ORC |
Yes |
|
MySQL |
MySQL table data |
Yes |