Advertisement
Guest User

scrapy

a guest
Feb 26th, 2017
596
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
Bash 4.24 KB | None | 0 0
  1. Erebos:scraper andreas$ scrapy shell -s USER_AGENT="Mozilla/5.0 (Macintosh; Intel Mac OS X 10_12_3) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/56.0.2924.87 Safari/537.36" http://www.firmenabc.at/result.aspx?what=&where=Graz
  2. [1] 26354
  3. Erebos:scraper andreas$ 2017-02-26 14:47:34 [scrapy.utils.log] INFO: Scrapy 1.3.2 started (bot: scrapybot)
  4. 2017-02-26 14:47:34 [scrapy.utils.log] INFO: Overridden settings: {'DUPEFILTER_CLASS': 'scrapy.dupefilters.BaseDupeFilter', 'LOGSTATS_INTERVAL': 0, 'USER_AGENT': 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_12_3) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/56.0.2924.87 Safari/537.36'}
  5. 2017-02-26 14:47:34 [scrapy.middleware] INFO: Enabled extensions:
  6. ['scrapy.extensions.corestats.CoreStats',
  7.  'scrapy.extensions.telnet.TelnetConsole']
  8. 2017-02-26 14:47:34 [scrapy.middleware] INFO: Enabled downloader middlewares:
  9. ['scrapy.downloadermiddlewares.httpauth.HttpAuthMiddleware',
  10.  'scrapy.downloadermiddlewares.downloadtimeout.DownloadTimeoutMiddleware',
  11.  'scrapy.downloadermiddlewares.defaultheaders.DefaultHeadersMiddleware',
  12.  'scrapy.downloadermiddlewares.useragent.UserAgentMiddleware',
  13.  'scrapy.downloadermiddlewares.retry.RetryMiddleware',
  14.  'scrapy.downloadermiddlewares.redirect.MetaRefreshMiddleware',
  15.  'scrapy.downloadermiddlewares.httpcompression.HttpCompressionMiddleware',
  16.  'scrapy.downloadermiddlewares.redirect.RedirectMiddleware',
  17.  'scrapy.downloadermiddlewares.cookies.CookiesMiddleware',
  18.  'scrapy.downloadermiddlewares.stats.DownloaderStats']
  19. 2017-02-26 14:47:34 [scrapy.middleware] INFO: Enabled spider middlewares:
  20. ['scrapy.spidermiddlewares.httperror.HttpErrorMiddleware',
  21.  'scrapy.spidermiddlewares.offsite.OffsiteMiddleware',
  22.  'scrapy.spidermiddlewares.referer.RefererMiddleware',
  23.  'scrapy.spidermiddlewares.urllength.UrlLengthMiddleware',
  24.  'scrapy.spidermiddlewares.depth.DepthMiddleware']
  25. 2017-02-26 14:47:34 [scrapy.middleware] INFO: Enabled item pipelines:
  26. []
  27. 2017-02-26 14:47:34 [scrapy.extensions.telnet] DEBUG: Telnet console listening on 127.0.0.1:6023
  28. 2017-02-26 14:47:34 [scrapy.core.engine] INFO: Spider opened
  29. 2017-02-26 14:47:34 [scrapy.downloadermiddlewares.retry] DEBUG: Retrying <GET http://www.firmenabc.at/result.aspx?what=> (failed 1 times): [<twisted.python.failure.Failure twisted.internet.error.ConnectionLost: Connection to the other side was lost in a non-clean fashion.>]
  30. 2017-02-26 14:47:35 [scrapy.downloadermiddlewares.retry] DEBUG: Retrying <GET http://www.firmenabc.at/result.aspx?what=> (failed 2 times): [<twisted.python.failure.Failure twisted.internet.error.ConnectionLost: Connection to the other side was lost in a non-clean fashion.>]
  31. 2017-02-26 14:47:35 [scrapy.downloadermiddlewares.retry] DEBUG: Gave up retrying <GET http://www.firmenabc.at/result.aspx?what=> (failed 3 times): [<twisted.python.failure.Failure twisted.internet.error.ConnectionLost: Connection to the other side was lost in a non-clean fashion.>]
  32. Traceback (most recent call last):
  33.   File "/usr/local/bin/scrapy", line 11, in <module>
  34.     sys.exit(execute())
  35.   File "/usr/local/lib/python3.6/site-packages/scrapy/cmdline.py", line 142, in execute
  36.     _run_print_help(parser, _run_command, cmd, args, opts)
  37.   File "/usr/local/lib/python3.6/site-packages/scrapy/cmdline.py", line 88, in _run_print_help
  38.     func(*a, **kw)
  39.   File "/usr/local/lib/python3.6/site-packages/scrapy/cmdline.py", line 149, in _run_command
  40.     cmd.run(args, opts)
  41.   File "/usr/local/lib/python3.6/site-packages/scrapy/commands/shell.py", line 73, in run
  42.     shell.start(url=url, redirect=not opts.no_redirect)
  43.   File "/usr/local/lib/python3.6/site-packages/scrapy/shell.py", line 48, in start
  44.     self.fetch(url, spider, redirect=redirect)
  45.   File "/usr/local/lib/python3.6/site-packages/scrapy/shell.py", line 115, in fetch
  46.     reactor, self._schedule, request, spider)
  47.   File "/usr/local/lib/python3.6/site-packages/twisted/internet/threads.py", line 122, in blockingCallFromThread
  48.     result.raiseException()
  49.   File "/usr/local/lib/python3.6/site-packages/twisted/python/failure.py", line 372, in raiseException
  50.     raise self.value.with_traceback(self.tb)
  51. twisted.web._newclient.ResponseNeverReceived: [<twisted.python.failure.Failure twisted.internet.error.ConnectionLost: Connection to the other side was lost in a non-clean fashion.>]
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement