Advertisement
Guest User

Untitled

a guest
Aug 9th, 2016
199
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 0.73 KB | None | 0 0
  1. val opts = Map(
  2. "url" -> s"jdbc:postgresql://$DB_HOST:$DB_PORT/$DATABASE",
  3. "driver" -> "org.postgresql.Driver",
  4. "dbtable" -> DB_TABLE,
  5. "user" -> DB_USER,
  6. "password"-> DB_PASSWORD,
  7. "partitionColumn" -> "id",
  8. "lowerBound" -> "1",
  9. "upperBound" -> "96509080",
  10. "numPartitions" -> "10000"
  11. )
  12.  
  13. val reportsDf = sparkSession.read.format("jdbc").options(opts).load
  14.  
  15. reportsDf.createOrReplaceTempView("custom_reports")
  16.  
  17. val reportId = reportsDf.select("fileId").distinct.as[String].collect()
  18.  
  19. reportId.repartition(100).cache()
  20.  
  21. ....
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement