Advertisement
Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- from pyspark import SparkContext
- from pyspark.sql import SparkSession
- sc = SparkContext.getOrCreate()
- spark = SparkSession(sc)
- spark.read.load( "users.csv", format="csv", sep="|" ) \
- .toDF( "id","age","gender","occupation","zip" ) \
- .groupby( "gender" ) \
- .count().show()
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement