Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- l = range(1, 10000)
- lRDD = sc.parallelize(l)
- productsRaw = open("/data/retail_db/products/part-00000").read().splitlines()
- type(productsRaw)
- productsRDD = sc.parallelize(productsRaw)
- type(productsRDD)
- productsRDD.first()
- for i in productsRDD.take(10): print(i)
- productsRDD.count()
Add Comment
Please, Sign In to add comment