josephxsxn

Orc Stripe tests - File Merging

Jun 5th, 2017
84
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!

Test file I made over time using LOTS of small ORC files and ALTER TABLE CONCATENATE

-rwxr-xr-x   3 worker hdfs 1596985161 2017-06-03 21:53 /apps/hive/warehouse/orc_merge_tests/000000_0
hive --orcfiledump /apps/hive/warehouse/orc_merge_tests/000000_0 

62458 stripes, most very small...
see dump here ->
see configs here ->

HIVE-fd7ba3db-21a3-41b7-8ff0-a1d0bd25a476

Defaults, Cold Session

select count(*), line from orc_merge_tests group by line;
(with hot session)

Still running after 40 minutes in Tez, Gave up on waiting.Went to MR

Ran 6 mappers in MR..
Elapsed: 7hrs, 28mins, 2sec [IN MR]

Insert Overwrite with a massive (28gb) single reducer to get a big file

Add Comment
Please, Sign In to add comment