2 个回答
Doris支持多种数据导出方式,例如:
1、使用export命令导出,适用于TB级数据高效导出到HDFS、S3、OSS等分布式存储。
1、使用export命令导出,适用于TB级数据高效导出到HDFS、S3、OSS等分布式存储。
EXPORT TABLE db1.orders
PARTITION (p202301, p202302)
TO "hdfs://hadoop-cluster/export/orders/"
WITH BROKER "broker1"
PROPERTIES (
"format" = "csv", -- 支持csv/parquet
"column_separator" = ","
);
2、适用select into outfile导出,适用于中小规模数据(GB级)导出到本地或远程文件系统。SELECT * FROM orders
WHERE dt = '2023-11-01'
INTO OUTFILE "file:///data/export/orders.csv"
FORMAT AS CSV
PROPERTIES (
"max_file_size" = "1GB"
);
发布于:2周前 (05-21) IP属地:
3、JDBC程序导出,适用于将数据导入到其他数据库(如MySQL、PostgreSQL)或自定义系统。
4、直接将数据写入外部存储
4、直接将数据写入外部存储
#创建外部表
CREATE CATALOG hive PROPERTIES (
"type" = "hms",
"hive.metastore.uris" = "thrift://hms-host:9083"
);
#导出数据
INSERT INTO hive.analytics.orders_all
SELECT * FROM doris_db.orders;
5、备份恢复间接导出# 1. 创建备份
BACKUP SNAPSHOT db1.backup_20231101
TO "s3://bucket/backup/"
ON db1.orders
PROPERTIES ("type" = "full");
# 2. 恢复至新集群(实现导出)
RESTORE SNAPSHOT db1.backup_20231101
FROM "s3://bucket/backup/";
发布于:2周前 (05-21) IP属地:
我来回答
您需要 登录 后回答此问题!