Advertisement
stuppid_bot

Parse sitemap.xml via ElementTree on python

Jul 5th, 2014
231
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
Python 0.45 KB | None | 0 0
  1. # -*- coding: utf8 -*-
  2. import urllib
  3. import xml.etree.ElementTree as ET
  4. import re
  5.  
  6. def get_page(u):
  7.     h = urllib.urlopen(u)
  8.     # print h.info()
  9.     d = h.read()
  10.     return d.decode('utf8')
  11.  
  12. if __name__ == '__main__':
  13.     content = get_page('http://kino-max.com/sitemap.xml')
  14.     root = ET.fromstring(content)
  15.     ns = root.tag[:root.tag.find('}') + 1]
  16.     for el in root:
  17.         print el.find(ns + 'loc').text
  18.         break
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement