impala数据插入的方法详解

时间：2015-11-20 02:09:18 阅读：382 评论：0 收藏：0 [点我收藏+]

impala是一种内存计算的数据库，查询性能相比于hive官网称是快100倍，其向表中插入数据的方法如下：

１、insert into

[slave12:21000] > insert into parquet_snappy select * from raw_text_data;
Inserted 1000000000 rows in 181.98s

2、CTAS

[slave12:21000] > create table test_table ?STORED AS PARQUET as select * from table;
Query: create table?test_table ?STORED AS PARQUET as select * from table
+-------------------------+
| summary???????????????? |
+-------------------------+
| Inserted 80000 row(s) |
+-------------------------+

3、load data?

[slave12:21000] > load data inpath ‘/user/hive/warehouse/test.db/table‘ into table test_table;
Query: load data inpath ‘/user/hive/warehouse/test.db/table‘ into table?test_table
+----------------------------------------------------------+
| summary????????????????????????????????????????????????? |
+----------------------------------------------------------+
| Loaded 1 file(s). Total files in destination location: 1 |
+----------------------------------------------------------+

此处注意，此种方法只能导入hdfs上的文件，不支持导入本地文件，不能像hive一样，加入local去导入本地文件，同时load之后，原表需要refresh，否则会报错

impala数据插入的方法详解

原文：http://daizj.iteye.com/blog/2257814

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年09月23日 (328)
2021年09月24日 (313)
2021年09月17日 (191)
2021年09月15日 (369)
2021年09月16日 (411)
2021年09月13日 (439)
2021年09月11日 (398)
2021年09月12日 (393)
2021年09月10日 (160)
2021年09月08日 (222)