Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- Explain
- STAGE DEPENDENCIES:
- Stage-1 is a root stage
- Stage-0 depends on stages: Stage-1
- Stage-2 depends on stages: Stage-0
- STAGE PLANS:
- Stage: Stage-1
- Map Reduce
- Map Operator Tree:
- TableScan
- alias: test_tez_input
- filterExpr: ((to_date(start_date_time) <= 2016-11-20) and ((to_date(end_date_time) > 2016-11-20) or end_date_time is null)) (type: boolean)
- Statistics: Num rows: 3925863392 Data size: 514735215147 Basic stats: COMPLETE Column stats: NONE
- Filter Operator
- predicate: ((to_date(start_date_time) <= 2016-11-20) and ((to_date(end_date_time) > 2016-11-20) or end_date_time is null)) (type: boolean)
- Statistics: Num rows: 1090517608 Data size: 142982004090 Basic stats: COMPLETE Column stats: NONE
- Select Operator
- expressions: l_id (type: string), psla (type: int), fsp (type: double), lis (type: boolean), aver (type: string), asub (type: string), iid (type: string)
- outputColumnNames: _col0, _col1, _col2, _col3, _col4, _col5, _col6
- Statistics: Num rows: 1090517608 Data size: 142982004090 Basic stats: COMPLETE Column stats: NONE
- Union
- Statistics: Num rows: 4362070432 Data size: 571928016360 Basic stats: COMPLETE Column stats: NONE
- Group By Operator
- aggregations: count(DISTINCT _col0), count(DISTINCT _col6)
- keys: _col1 (type: int), _col2 (type: double), _col3 (type: boolean), _col4 (type: string), _col5 (type: string), _col0 (type: string), _col6 (type: string)
- mode: hash
- outputColumnNames: _col0, _col1, _col2, _col3, _col4, _col5, _col6, _col7, _col8
- Statistics: Num rows: 4362070432 Data size: 571928016360 Basic stats: COMPLETE Column stats: NONE
- Reduce Output Operator
- key expressions: _col0 (type: int), _col1 (type: double), _col2 (type: boolean), _col3 (type: string), _col4 (type: string), _col5 (type: string), _col6 (type: string)
- sort order: +++++++
- Map-reduce partition columns: _col0 (type: int), _col1 (type: double), _col2 (type: boolean), _col3 (type: string), _col4 (type: string)
- Statistics: Num rows: 4362070432 Data size: 571928016360 Basic stats: COMPLETE Column stats: NONE
- TableScan
- alias: test_tez_input
- filterExpr: ((to_date(start_date_time) <= 2016-11-20) and ((to_date(end_date_time) > 2016-11-20) or end_date_time is null)) (type: boolean)
- Statistics: Num rows: 3925863392 Data size: 514735215147 Basic stats: COMPLETE Column stats: NONE
- Filter Operator
- predicate: ((to_date(start_date_time) <= 2016-11-20) and ((to_date(end_date_time) > 2016-11-20) or end_date_time is null)) (type: boolean)
- Statistics: Num rows: 1090517608 Data size: 142982004090 Basic stats: COMPLETE Column stats: NONE
- Select Operator
- expressions: l_id (type: string), psla (type: int), fsp (type: double), lis (type: boolean), aver (type: string), asub (type: string), iid (type: string)
- outputColumnNames: _col0, _col1, _col2, _col3, _col4, _col5, _col6
- Statistics: Num rows: 1090517608 Data size: 142982004090 Basic stats: COMPLETE Column stats: NONE
- Union
- Statistics: Num rows: 4362070432 Data size: 571928016360 Basic stats: COMPLETE Column stats: NONE
- Group By Operator
- aggregations: count(DISTINCT _col0), count(DISTINCT _col6)
- keys: _col1 (type: int), _col2 (type: double), _col3 (type: boolean), _col4 (type: string), _col5 (type: string), _col0 (type: string), _col6 (type: string)
- mode: hash
- outputColumnNames: _col0, _col1, _col2, _col3, _col4, _col5, _col6, _col7, _col8
- Statistics: Num rows: 4362070432 Data size: 571928016360 Basic stats: COMPLETE Column stats: NONE
- Reduce Output Operator
- key expressions: _col0 (type: int), _col1 (type: double), _col2 (type: boolean), _col3 (type: string), _col4 (type: string), _col5 (type: string), _col6 (type: string)
- sort order: +++++++
- Map-reduce partition columns: _col0 (type: int), _col1 (type: double), _col2 (type: boolean), _col3 (type: string), _col4 (type: string)
- Statistics: Num rows: 4362070432 Data size: 571928016360 Basic stats: COMPLETE Column stats: NONE
- TableScan
- alias: test_tez_input
- filterExpr: ((to_date(start_date_time) <= 2016-11-20) and ((to_date(end_date_time) > 2016-11-20) or end_date_time is null)) (type: boolean)
- Statistics: Num rows: 3925863392 Data size: 514735215147 Basic stats: COMPLETE Column stats: NONE
- Filter Operator
- predicate: ((to_date(start_date_time) <= 2016-11-20) and ((to_date(end_date_time) > 2016-11-20) or end_date_time is null)) (type: boolean)
- Statistics: Num rows: 1090517608 Data size: 142982004090 Basic stats: COMPLETE Column stats: NONE
- Select Operator
- expressions: l_id (type: string), psla (type: int), fsp (type: double), lis (type: boolean), aver (type: string), asub (type: string), iid (type: string)
- outputColumnNames: _col0, _col1, _col2, _col3, _col4, _col5, _col6
- Statistics: Num rows: 1090517608 Data size: 142982004090 Basic stats: COMPLETE Column stats: NONE
- Union
- Statistics: Num rows: 4362070432 Data size: 571928016360 Basic stats: COMPLETE Column stats: NONE
- Group By Operator
- aggregations: count(DISTINCT _col0), count(DISTINCT _col6)
- keys: _col1 (type: int), _col2 (type: double), _col3 (type: boolean), _col4 (type: string), _col5 (type: string), _col0 (type: string), _col6 (type: string)
- mode: hash
- outputColumnNames: _col0, _col1, _col2, _col3, _col4, _col5, _col6, _col7, _col8
- Statistics: Num rows: 4362070432 Data size: 571928016360 Basic stats: COMPLETE Column stats: NONE
- Reduce Output Operator
- key expressions: _col0 (type: int), _col1 (type: double), _col2 (type: boolean), _col3 (type: string), _col4 (type: string), _col5 (type: string), _col6 (type: string)
- sort order: +++++++
- Map-reduce partition columns: _col0 (type: int), _col1 (type: double), _col2 (type: boolean), _col3 (type: string), _col4 (type: string)
- Statistics: Num rows: 4362070432 Data size: 571928016360 Basic stats: COMPLETE Column stats: NONE
- TableScan
- alias: test_tez_input
- filterExpr: ((to_date(start_date_time) <= 2016-11-20) and ((to_date(end_date_time) > 2016-11-20) or end_date_time is null)) (type: boolean)
- Statistics: Num rows: 3925863392 Data size: 514735215147 Basic stats: COMPLETE Column stats: NONE
- Filter Operator
- predicate: ((to_date(start_date_time) <= 2016-11-20) and ((to_date(end_date_time) > 2016-11-20) or end_date_time is null)) (type: boolean)
- Statistics: Num rows: 1090517608 Data size: 142982004090 Basic stats: COMPLETE Column stats: NONE
- Select Operator
- expressions: l_id (type: string), psla (type: int), fsp (type: double), lis (type: boolean), aver (type: string), asub (type: string), iid (type: string)
- outputColumnNames: _col0, _col1, _col2, _col3, _col4, _col5, _col6
- Statistics: Num rows: 1090517608 Data size: 142982004090 Basic stats: COMPLETE Column stats: NONE
- Union
- Statistics: Num rows: 4362070432 Data size: 571928016360 Basic stats: COMPLETE Column stats: NONE
- Group By Operator
- aggregations: count(DISTINCT _col0), count(DISTINCT _col6)
- keys: _col1 (type: int), _col2 (type: double), _col3 (type: boolean), _col4 (type: string), _col5 (type: string), _col0 (type: string), _col6 (type: string)
- mode: hash
- outputColumnNames: _col0, _col1, _col2, _col3, _col4, _col5, _col6, _col7, _col8
- Statistics: Num rows: 4362070432 Data size: 571928016360 Basic stats: COMPLETE Column stats: NONE
- Reduce Output Operator
- key expressions: _col0 (type: int), _col1 (type: double), _col2 (type: boolean), _col3 (type: string), _col4 (type: string), _col5 (type: string), _col6 (type: string)
- sort order: +++++++
- Map-reduce partition columns: _col0 (type: int), _col1 (type: double), _col2 (type: boolean), _col3 (type: string), _col4 (type: string)
- Statistics: Num rows: 4362070432 Data size: 571928016360 Basic stats: COMPLETE Column stats: NONE
- Reduce Operator Tree:
- Group By Operator
- aggregations: count(DISTINCT KEY._col5:0._col0), count(DISTINCT KEY._col5:1._col0)
- keys: KEY._col0 (type: int), KEY._col1 (type: double), KEY._col2 (type: boolean), KEY._col3 (type: string), KEY._col4 (type: string)
- mode: mergepartial
- outputColumnNames: _col0, _col1, _col2, _col3, _col4, _col5, _col6
- Statistics: Num rows: 2181035216 Data size: 285964008180 Basic stats: COMPLETE Column stats: NONE
- Select Operator
- expressions: _col3 (type: string), _col4 (type: string), '2016-11-20 00:00:00' (type: string), _col5 (type: bigint), CASE WHEN (((_col0 >= 0) and (_col1 > 0.0) and (_col2 = '1'))) THEN (_col5) ELSE (0) END (type: bigint), _col6 (type: bigint), CASE WHEN (((_col0 >= 0) and (_col1 > 0.0) and (_col2 = '1'))) THEN (_col6) ELSE (0) END (type: bigint)
- outputColumnNames: _col0, _col1, _col2, _col3, _col4, _col5, _col6
- Statistics: Num rows: 2181035216 Data size: 285964008180 Basic stats: COMPLETE Column stats: NONE
- File Output Operator
- compressed: false
- Statistics: Num rows: 2181035216 Data size: 285964008180 Basic stats: COMPLETE Column stats: NONE
- table:
- input format: org.apache.hadoop.mapred.TextInputFormat
- output format: org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat
- serde: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe
- name: test.test_tez_multiunion
- Stage: Stage-0
- Move Operator
- tables:
- replace: true
- table:
- input format: org.apache.hadoop.mapred.TextInputFormat
- output format: org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat
- serde: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe
- name: test.test_tez_multiunion
- Stage: Stage-2
- Stats-Aggr Operator
Add Comment
Please, Sign In to add comment