Guest User

Untitled

a guest
Feb 21st, 2018
58
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 1.25 KB | None | 0 0
  1. user document
  2. 0 john book
  3. 1 jane article
  4. 2 jane book
  5. 3 jane book
  6. 4 jim article
  7. 5 john book
  8. 6 jim blogpost
  9. 7 jane blogpost
  10. 8 jane blogpost
  11. 9 jane blogpost
  12.  
  13. blogpost article book
  14. john 1 3 0
  15. jane 0 0 1
  16. jim 4 0 2
  17.  
  18. df = pd.DataFrame(index=users, columns=documents)
  19. df.fillna(0, inplace=True)
  20.  
  21. grouped = records.groupby(['user', 'document'])
  22. for elem in grouped:
  23. user, document = elem[0]
  24. downloads = len(elem[1])
  25. df.loc[user, document] = downloads
  26.  
  27. records = pd.DataFrame([
  28. ('john', 'book'),
  29. ('jane', 'article'),
  30. ('jane','book'),
  31. ('jane','book'),
  32. ('jim', 'article'),
  33. ('john', 'book'),
  34. ('jim', 'blogpost'),
  35. ('jane', 'blogpost'),
  36. ('jane', 'blogpost'),
  37. ('jane', 'blogpost')
  38. ], columns=['user', 'document'])
  39. print(records)
  40.  
  41. users = list(set(records['user']))
  42. users.sort()
  43. documents = list(set(records['document']))
  44. documents.sort()
  45.  
  46. print(users)
  47. print(documents)
  48.  
  49. df = pd.DataFrame(index=users, columns=documents)
  50. df.fillna(0, inplace=True)
  51. print(df)
  52.  
  53. grouped = records.groupby(['user', 'document'])
  54. for elem in grouped:
  55. user, document = elem[0]
  56. downloads = len(elem[1])
  57. df.loc[user, document] = downloads
Add Comment
Please, Sign In to add comment