Advertisement
ANCHI22

data science - cheat sheets - spark

Jun 20th, 2019
134
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
Bash 1.43 KB | None | 0 0
  1. # Books and cheat sheet
  2. https://perso.limsi.fr/pointal/_media/python:cours:mementopython3-english.pdf
  3. https://s3.amazonaws.com/assets.datacamp.com/blog_assets/PySpark_SQL_Cheat_Sheet_Python.pdf
  4. https://jaxenter.com/cheat-sheet-complete-guide-scala-136558.html
  5. https://alvinalexander.com/downloads/scala/Scala-Cheat-Sheet-devdaily.pdf
  6. https://riptutorial.com/ebook/scala
  7. https://mbonaci.github.io/scala/
  8. https://www.artima.com/shop/cart
  9. https://riptutorial.com/Download/scala-language.pdf
  10. https://sinxloud.com/python-cheat-sheet-beginner-advanced/
  11. https://ehmatthes.github.io/pcc/cheatsheets/README.html
  12. https://www.datacamp.com/community/data-science-cheatsheets
  13. http://www.lirmm.fr/~ducour/Doc-objets/scalabook.pdf
  14.  
  15. # Install spark, scala, java on wsl or ubuntu
  16. https://www.knowledgehut.com/blog/big-data/install-spark-on-ubuntu
  17. # Install java jdk
  18. sudo apt install default-jdk
  19. # Check version
  20. java -version
  21.  
  22. # Install scala
  23. sudo apt install scala
  24. # Check version
  25. scala -version
  26.  
  27. # install spark, hadoop
  28. wget http://apache.dattatec.com/spark/spark-2.4.3/spark-2.4.3-bin-hadoop2.7.tgz
  29. # create folder for spark
  30. sudo mkdir /usr/local/spark
  31. # decompress in new folder
  32. sudo tar xvf spark-2.4.3-bin-hadoop2.7.tgz -C /usr/local/spark
  33. # add path
  34. sudo nano ~/.bashrc
  35. # Copy and paste code below and save
  36. SPARK_HOME=/usr/local/spark/spark-2.4.3-bin-hadoop2.7
  37. export PATH=$SPARK_HOME/bin:$PATH
  38. #check spark and enjoy
  39. spark-shell
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement