Advertisement
Guest User

Untitled

a guest
Jan 9th, 2011
318
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
Python 0.23 KB | None | 0 0
  1. import html5lib
  2. from lxml.cssselect import CSSSelector
  3. selector = CSSSelector("#article-content h1")
  4. doc = html5lib.parse('<div id="article-content"><h1>Text here</h1></div>', treebuilder="lxml")
  5. print [e.text for e in selector(doc)]
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement