Advertisement
Guest User

Untitled

a guest
May 18th, 2012
149
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 2.98 KB | None | 0 0
  1. /usr/lib/python2.6/site-packages/bs4/builder/_htmlparser.py:149: RuntimeWarning: Python's built-in HTMLParser cannot parse the given document. This is not a bug in Beautiful Soup. The best solution is to install an external parser (lxml or html5lib), and use Beautiful Soup with that parser. See http://www.crummy.com/software/BeautifulSoup/bs4/doc/#installing-a-parser for help.
  2. "Python's built-in HTMLParser cannot parse the given document. This is not a bug in Beautiful Soup. The best solution is to install an external parser (lxml or html5lib), and use Beautiful Soup with that parser. See http://www.crummy.com/software/BeautifulSoup/bs4/doc/#installing-a-parser for help."))
  3. Traceback (most recent call last):
  4. File "/usr/lib/python2.6/site-packages/eventlet-0.9.16-py2.6.egg/eventlet/hubs/poll.py", line 97, in wait
  5. readers.get(fileno, noop).cb(fileno)
  6. File "/usr/lib/python2.6/site-packages/eventlet-0.9.16-py2.6.egg/eventlet/greenthread.py", line 192, in main
  7. result = function(*args, **kwargs)
  8. File "crawl.py", line 29, in retrieve_links
  9. b = BeautifulSoup(src)
  10. File "/usr/lib/python2.6/site-packages/bs4/__init__.py", line 172, in __init__
  11. self._feed()
  12. File "/usr/lib/python2.6/site-packages/bs4/__init__.py", line 185, in _feed
  13. self.builder.feed(self.markup)
  14. File "/usr/lib/python2.6/site-packages/bs4/builder/_htmlparser.py", line 150, in feed
  15. raise e
  16. HTMLParseError: bad end tag: u'</a";\n\t\t\t\t}\n\t\t\t\tadNode += "</div>', at line 130, column 151
  17. Removing descriptor: 3
  18. Traceback (most recent call last):
  19. File "crawl.py", line 78, in <module>
  20. begin_crawling()
  21. File "crawl.py", line 71, in begin_crawling
  22. data = crawl()
  23. File "crawl.py", line 56, in crawl
  24. for link in green_pool.imap(retrieve_links, crawl_queue):
  25. File "/usr/lib/python2.6/site-packages/eventlet-0.9.16-py2.6.egg/eventlet/greenpool.py", line 232, in next
  26. val = self.waiters.get().wait()
  27. File "/usr/lib/python2.6/site-packages/eventlet-0.9.16-py2.6.egg/eventlet/greenthread.py", line 166, in wait
  28. return self._exit_event.wait()
  29. File "/usr/lib/python2.6/site-packages/eventlet-0.9.16-py2.6.egg/eventlet/event.py", line 116, in wait
  30. return hubs.get_hub().switch()
  31. File "/usr/lib/python2.6/site-packages/eventlet-0.9.16-py2.6.egg/eventlet/hubs/hub.py", line 177, in switch
  32. return self.greenlet.switch()
  33. File "/usr/lib/python2.6/site-packages/eventlet-0.9.16-py2.6.egg/eventlet/greenthread.py", line 192, in main
  34. result = function(*args, **kwargs)
  35. File "crawl.py", line 29, in retrieve_links
  36. b = BeautifulSoup(src)
  37. File "/usr/lib/python2.6/site-packages/bs4/__init__.py", line 172, in __init__
  38. self._feed()
  39. File "/usr/lib/python2.6/site-packages/bs4/__init__.py", line 185, in _feed
  40. self.builder.feed(self.markup)
  41. File "/usr/lib/python2.6/site-packages/bs4/builder/_htmlparser.py", line 150, in feed
  42. raise e
  43. HTMLParser.HTMLParseError: bad end tag: u'</a";\n\t\t\t\t}\n\t\t\t\tadNode += "</div>', at line 130, column 151
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement