Advertisement
Guest User

Untitled

a guest
Oct 25th, 2014
165
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 1.62 KB | None | 0 0
  1. Traceback (most recent call last):
  2. File "run_urltests.py", line 28, in <module>
  3. random=arguments['--random']
  4. File "/Users/julielavoie/work/sawhorse/bylines/tests/performance.py", line 23, in wrapper
  5. fn(*args, **kwargs)
  6. File "/Users/julielavoie/work/sawhorse/bylines/tests/performance.py", line 193, in test_extractor_performance_on_html
  7. 'extracted': call_extractor_on_file(extractor, url)
  8. File "/Users/julielavoie/work/sawhorse/bylines/tests/performance.py", line 174, in call_extractor_on_file
  9. result = extractor(filename)
  10. File "/Users/julielavoie/work/sawhorse/bylines/bylines/extractor.py", line 98, in find_authors_from_file
  11. authors = find_authors_from_html(content)
  12. File "/Users/julielavoie/work/sawhorse/bylines/bylines/extractor.py", line 131, in find_authors_from_html
  13. authors = find_authors_from_lxml_tree(lxml.html.fromstring(html))
  14. File "/Users/julielavoie/work/sawhorse/bylines/bylines/extractor.py", line 259, in find_authors_from_lxml_tree
  15. element = drop_tags(element, DROP_TAGS)
  16. File "/Users/julielavoie/work/sawhorse/bylines/bylines/extractor.py", line 246, in drop_tags
  17. tag.drop_tree()
  18. File "/Library/Python/2.7/site-packages/lxml/html/__init__.py", line 199, in drop_tree
  19. if self.tail:
  20. File "lxml.etree.pyx", line 931, in lxml.etree._Element.tail.__get__ (src/lxml/lxml.etree.c:41551)
  21. File "apihelpers.pxi", line 620, in lxml.etree._collectText (src/lxml/lxml.etree.c:18637)
  22. File "apihelpers.pxi", line 1322, in lxml.etree.funicode (src/lxml/lxml.etree.c:24615)
  23. UnicodeDecodeError: 'utf8' codec can't decode byte 0x92 in position 33: invalid start byte
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement