Advertisement
Guest User

Untitled

a guest
Jul 14th, 2012
29
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 0.28 KB | None | 0 0
  1. dom = BeautifulSoup(untrusted_html)
  2.  
  3. def cleanHtml(dom):
  4. for a in dom.findAll('a'):
  5. a['rel'] = "nofollow"
  6.  
  7. map(lambda x: x.extract(), dom.findAll(['script', 'object', 'link', 'style']))
  8. return unicode(dom)
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement