Advertisement
gronke

temp code

May 22nd, 2014
187
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
Python 0.46 KB | None | 0 0
  1. import requests
  2. import lxml.html
  3. import cssselect
  4. import csv
  5.  
  6. req = requests.get('http://en.wikipedia.org/wiki/List_of_AZA_member_zoos_and_aquaria')
  7. root = lxml.html.fromstring(req.text)
  8. b = root.cssselect('table:first-of-type tr')
  9. c = []
  10.  
  11. urls = []
  12.  
  13. for row in b[1:]:
  14.     cells = row.cssselect('tr')
  15.     c.append(cells[0].text_content().strip())
  16.     sites = row.cssselect('a href')
  17.         links = cells[0].cssselect('a')
  18.         urls.append(links[3].get('href'))
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement