Advertisement
Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- Traceback (most recent call last):
- File "run_urltests.py", line 28, in <module>
- random=arguments['--random']
- File "/Users/julielavoie/work/sawhorse/bylines/tests/performance.py", line 23, in wrapper
- fn(*args, **kwargs)
- File "/Users/julielavoie/work/sawhorse/bylines/tests/performance.py", line 193, in test_extractor_performance_on_html
- 'extracted': call_extractor_on_file(extractor, url)
- File "/Users/julielavoie/work/sawhorse/bylines/tests/performance.py", line 174, in call_extractor_on_file
- result = extractor(filename)
- File "/Users/julielavoie/work/sawhorse/bylines/bylines/extractor.py", line 98, in find_authors_from_file
- authors = find_authors_from_html(content)
- File "/Users/julielavoie/work/sawhorse/bylines/bylines/extractor.py", line 131, in find_authors_from_html
- authors = find_authors_from_lxml_tree(lxml.html.fromstring(html))
- File "/Users/julielavoie/work/sawhorse/bylines/bylines/extractor.py", line 259, in find_authors_from_lxml_tree
- element = drop_tags(element, DROP_TAGS)
- File "/Users/julielavoie/work/sawhorse/bylines/bylines/extractor.py", line 246, in drop_tags
- tag.drop_tree()
- File "/Library/Python/2.7/site-packages/lxml/html/__init__.py", line 199, in drop_tree
- if self.tail:
- File "lxml.etree.pyx", line 931, in lxml.etree._Element.tail.__get__ (src/lxml/lxml.etree.c:41551)
- File "apihelpers.pxi", line 620, in lxml.etree._collectText (src/lxml/lxml.etree.c:18637)
- File "apihelpers.pxi", line 1322, in lxml.etree.funicode (src/lxml/lxml.etree.c:24615)
- UnicodeDecodeError: 'utf8' codec can't decode byte 0x92 in position 33: invalid start byte
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement