Guest User

Untitled

a guest
Jan 23rd, 2018
64
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 0.98 KB | None | 0 0
  1. group_id: xxx
  2. medicore_id: xxxxxxx
  3. date_of_visit_sample_drawn_date: xxxxxxx
  4. rin: xxxxxx
  5. donor_id: xxxxx
  6. sle_visit_designation: xxxxxxx
  7. bold_shipment_batch: xxxxxx
  8. rna_concentrated: xxxxxx
  9. subject_type: xxxxxxx
  10.  
  11. from bs4 import BeautifulSoup
  12. import requests
  13.  
  14. source= requests.get("https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?
  15. acc=GSM2437275").text
  16.  
  17.  
  18. soup = BeautifulSoup(source, 'lxml')
  19. table = soup.find_all('table')
  20. for i in table:
  21. print i.prettify()
  22. print (len(table)) #22 tables
  23.  
  24. print (table[6].prettify()) #narrow down on relevant table
  25. table = table[6]
  26.  
  27. table_subtables = table.find_all('table')
  28. for i in table_subtables:
  29. print (i.prettify())
  30.  
  31. print len(table_subtables) #14 tables
  32.  
  33. tbb = table_subtables[1]
  34.  
  35. tbb_subtable = tbb.find_all('table')
  36. for i in tbb_subtable:
  37. print (i.prettify())
  38. print len(tbb_subtable) #12 tables
  39.  
  40. tbbb = tbb_subtable[5]
  41.  
  42. tbbb_subtable = tbbb.find_all('table')
  43. for i in tbbb_subtable:
  44. print (i.prettify())
  45. print len(tbbb_subtable) # 6 tables
Add Comment
Please, Sign In to add comment