Advertisement
Guest User

Untitled

a guest
Jul 1st, 2016
58
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 0.39 KB | None | 0 0
  1. import numpy
  2. from pyspark import SQLContext as sqlContext
  3. from pyspark.ml.feature import StopWordsRemover
  4.  
  5. sentenceData = sqlContext.createDataFrame([
  6. (0, ["I", "saw", "the", "red", "baloon"]),
  7. (1, ["Mary", "had", "a", "little", "lamb"])
  8. ], ["label", "raw"])
  9.  
  10. remover = StopWordsRemover(inputCol="raw", outputCol="filtered")
  11. remover.transform(sentenceData).show(truncate=False)
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement