Guest User


a guest
Aug 20th, 2019
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
  1. """
  2. Each element is a dictionary of attributes and their values for a post.
  3. Can you find the set of all attributes used throughout the RDD?
  4. The function dictionary.keys() gives you the list of attributes of a dictionary.
  5. :param dataset: dataset loaded in Spark context
  6. :type dataset: a Spark RDD
  7. :return: all unique attributes collected in a list
  8. """
RAW Paste Data