Advertisement
Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- import html5lib
- from lxml.cssselect import CSSSelector
- selector = CSSSelector("#article-content h1")
- doc = html5lib.parse('<div id="article-content"><h1>Text here</h1></div>', treebuilder="lxml")
- print [e.text for e in selector(doc)]
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement