Advertisement
Guest User

Untitled

a guest
Jun 16th, 2019
127
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 0.58 KB | None | 0 0
  1. df = spark.createDataFrame([(["c", "b", "a","e","f"],'a')], ['arraydata','item'])
  2.  
  3. df.select(df.arraydata, array_position(f.col("arraydata"),'a')).show()
  4.  
  5. +---------------+----------------------------+
  6. | arraydata|array_position(arraydata, a)|
  7. +---------------+----------------------------+
  8. |[c, b, a, e, f]| 3|
  9. +---------------+----------------------------+
  10.  
  11. from pyspark.sql.functions import array_position
  12. df.select(df.arraydata, array_position(f.col("arraydata"),f.col("item"))).show()
  13.  
  14. TypeError: Column is not iterable
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement