首页 > 编程语言 > 详细

Hive并行排序

时间:2014-11-14 02:17:37      阅读:692      评论:0      收藏:0      [点我收藏+]

set hive.optimize.sampling.orderby=true;
set hive.optimize.sampling.orderby.number=10000;
set hive.optimize.sampling.orderby.percent=0.1f;

?

?

记录一下,Hive中并行排序参数;

?

hive.optimize.sampling.orderby
??? Default Value: false
??? Added In: Hive 0.12.0 with HIVE-1402
Uses sampling on order-by clause for parallel execution.


hive.optimize.sampling.orderby.number
??? Default Value: 1000
??? Added In: Hive 0.12.0 with HIVE-1402
With hive.optimize.sampling.orderby=true, total number of samples to be obtained to calculate partition keys.


hive.optimize.sampling.orderby.percent
??? Default Value: 0.1
??? Added In: Hive 0.12.0 with HIVE-1402
With hive.optimize.sampling.orderby=true, probability with which a row will be chosen.

Hive并行排序

原文:http://superlxw1234.iteye.com/blog/2155436

(0)
(0)
   
举报
评论 一句话评论(0
关于我们 - 联系我们 - 留言反馈 - 联系我们:wmxa8@hotmail.com
© 2014 bubuko.com 版权所有
打开技术之扣,分享程序人生!