Advertisement
Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- Ejemplo RDD (java)
- ==================================
- http://www.agildata.com/apache-spark-rdd-vs-dataframe-vs-dataset/
- rdd.filter(p -> p.getAge() < 21)
- .map(p -> p.getLast())
- .saveAsObjectFile("under21.bin");
- Ejemplo con Dataframes (scala)
- ==================================
- https://databricks.com/blog/2016/07/14/a-tale-of-three-apache-spark-apis-rdds-dataframes-and-datasets.html
- Calcula la media de la temperatura y la humedad por cada país si la temperatura supera los 25 grados.
- val dsAvgTmp = ds.filter(d => {d.temp > 25})
- .map(d => (d.temp, d.humidity, d.cca3))
- .groupBy($"_3")
- .avg()
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement