QnaList > Groups > Sqoop-User > Nov 2015
faq

Is Hash-based Partition Supported By Sqoop?

Hi, guys,
I have a question how Sqoop imports the data in parallel. IMO, Sqoop first
gets the min and max values for the SPLIT_BY column, and then does a
range-based partition, to let each mapper consumes one range. Do we support
hash-based partition, like each mapper ingests the data satisfying query
"select * from table where hash(split_by) % n = i" ?
thanks,
Wei

asked Nov 5 2015 at 13:59

Wei Yan 's gravatar image



Related discussions

Tagged

Group Sqoop-user

asked Nov 5 2015 at 13:59

active Nov 5 2015 at 13:59

posts:1

users:1

©2013 QnaList.com