Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- 2017-07-26 09:38:37 [scrapy.utils.log] INFO: Scrapy 1.4.0 started (bot: Thesis)
- 2017-07-26 09:38:37 [scrapy.utils.log] INFO: Overridden settings: {'NEWSPIDER_MODULE': 'Thesis.spiders', 'CONCURRENT_REQUESTS': 200, 'SPIDER_MODULES': ['Thesis.spiders'], 'BOT_NAME': 'Thesis', 'CONCURRENT_ITEMS': 400, 'COOKIES_ENABLED': False, 'USER_AGENT': 'Mozilla/5.0 (Windows NT 6.3; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/59.0.3071.115 Safari/537.36', 'DOWNLOAD_DELAY': 4}
- 2017-07-26 09:38:37 [scrapy.middleware] INFO: Enabled extensions:
- ['scrapy.extensions.logstats.LogStats',
- 'scrapy.extensions.telnet.TelnetConsole',
- 'scrapy.extensions.corestats.CoreStats']
- 2017-07-26 09:38:40 [selenium.webdriver.remote.remote_connection] DEBUG: POST http://127.0.0.1:52986/wd/hub/session {"capabilities": {"alwaysMatch": {"platform": "ANY", "browserName": "phantomjs", "version": "", "javascriptEnabled": true}, "firstMatch": []}, "desiredCapabilities": {"platform": "ANY", "browserName": "phantomjs", "version": "", "javascriptEnabled": true}}
- 2017-07-26 09:38:40 [selenium.webdriver.remote.remote_connection] DEBUG: Finished Request
- 2017-07-26 09:38:40 [selenium.webdriver.remote.remote_connection] DEBUG: POST http://127.0.0.1:52986/wd/hub/session/71437af0-71d5-11e7-83de-29eb7aea97e9/window/current/size {"width": 1120, "windowHandle": "current", "sessionId": "71437af0-71d5-11e7-83de-29eb7aea97e9", "height": 550}
- 2017-07-26 09:38:40 [selenium.webdriver.remote.remote_connection] DEBUG: Finished Request
- 2017-07-26 09:38:40 [scrapy.middleware] INFO: Enabled downloader middlewares:
- ['scrapy.downloadermiddlewares.httpauth.HttpAuthMiddleware',
- 'scrapy.downloadermiddlewares.downloadtimeout.DownloadTimeoutMiddleware',
- 'scrapy.downloadermiddlewares.defaultheaders.DefaultHeadersMiddleware',
- 'scrapy.downloadermiddlewares.useragent.UserAgentMiddleware',
- 'scrapy.downloadermiddlewares.retry.RetryMiddleware',
- 'scrapy.downloadermiddlewares.redirect.MetaRefreshMiddleware',
- 'scrapy.downloadermiddlewares.httpcompression.HttpCompressionMiddleware',
- 'scrapy.downloadermiddlewares.redirect.RedirectMiddleware',
- 'scrapy.downloadermiddlewares.httpproxy.HttpProxyMiddleware',
- 'scrapy.downloadermiddlewares.stats.DownloaderStats']
- 2017-07-26 09:38:40 [scrapy.middleware] INFO: Enabled spider middlewares:
- ['scrapy.spidermiddlewares.httperror.HttpErrorMiddleware',
- 'scrapy.spidermiddlewares.offsite.OffsiteMiddleware',
- 'scrapy.spidermiddlewares.referer.RefererMiddleware',
- 'scrapy.spidermiddlewares.urllength.UrlLengthMiddleware',
- 'scrapy.spidermiddlewares.depth.DepthMiddleware']
- 2017-07-26 09:38:40 [scrapy.middleware] INFO: Enabled item pipelines:
- ['Thesis.pipelines.PcworldPipeline']
- 2017-07-26 09:38:40 [scrapy.core.engine] INFO: Spider opened
- 2017-07-26 09:38:40 [scrapy.extensions.logstats] INFO: Crawled 0 pages (at 0 pages/min), scraped 0 items (at 0 items/min)
- 2017-07-26 09:38:40 [scrapy.extensions.telnet] DEBUG: Telnet console listening on 127.0.0.1:6023
- 2017-07-26 09:38:43 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.pcworld.com/search?query=heartbleed> (referer: None)
- 2017-07-26 09:38:43 [selenium.webdriver.remote.remote_connection] DEBUG: POST http://127.0.0.1:52986/wd/hub/session/71437af0-71d5-11e7-83de-29eb7aea97e9/url {"url": "http://www.pcworld.com/search?query=heartbleed", "sessionId": "71437af0-71d5-11e7-83de-29eb7aea97e9"}
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: Finished Request
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: POST http://127.0.0.1:52986/wd/hub/session/71437af0-71d5-11e7-83de-29eb7aea97e9/element {"using": "class name", "sessionId": "71437af0-71d5-11e7-83de-29eb7aea97e9", "value": "excerpt-text"}
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: Finished Request
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: POST http://127.0.0.1:52986/wd/hub/session/71437af0-71d5-11e7-83de-29eb7aea97e9/elements {"using": "xpath", "sessionId": "71437af0-71d5-11e7-83de-29eb7aea97e9", "value": "//div[@class=\"excerpt-text\"]/h3/a"}
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: Finished Request
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: GET http://127.0.0.1:52986/wd/hub/session/71437af0-71d5-11e7-83de-29eb7aea97e9/element/:wdc:1501054740524/attribute/href {"sessionId": "71437af0-71d5-11e7-83de-29eb7aea97e9", "name": "href", "id": ":wdc:1501054740524"}
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: Finished Request
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: GET http://127.0.0.1:52986/wd/hub/session/71437af0-71d5-11e7-83de-29eb7aea97e9/element/:wdc:1501054740525/attribute/href {"sessionId": "71437af0-71d5-11e7-83de-29eb7aea97e9", "name": "href", "id": ":wdc:1501054740525"}
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: Finished Request
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: GET http://127.0.0.1:52986/wd/hub/session/71437af0-71d5-11e7-83de-29eb7aea97e9/element/:wdc:1501054740526/attribute/href {"sessionId": "71437af0-71d5-11e7-83de-29eb7aea97e9", "name": "href", "id": ":wdc:1501054740526"}
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: Finished Request
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: GET http://127.0.0.1:52986/wd/hub/session/71437af0-71d5-11e7-83de-29eb7aea97e9/element/:wdc:1501054740527/attribute/href {"sessionId": "71437af0-71d5-11e7-83de-29eb7aea97e9", "name": "href", "id": ":wdc:1501054740527"}
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: Finished Request
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: GET http://127.0.0.1:52986/wd/hub/session/71437af0-71d5-11e7-83de-29eb7aea97e9/element/:wdc:1501054740528/attribute/href {"sessionId": "71437af0-71d5-11e7-83de-29eb7aea97e9", "name": "href", "id": ":wdc:1501054740528"}
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: Finished Request
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: GET http://127.0.0.1:52986/wd/hub/session/71437af0-71d5-11e7-83de-29eb7aea97e9/element/:wdc:1501054740529/attribute/href {"sessionId": "71437af0-71d5-11e7-83de-29eb7aea97e9", "name": "href", "id": ":wdc:1501054740529"}
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: Finished Request
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: GET http://127.0.0.1:52986/wd/hub/session/71437af0-71d5-11e7-83de-29eb7aea97e9/element/:wdc:1501054740530/attribute/href {"sessionId": "71437af0-71d5-11e7-83de-29eb7aea97e9", "name": "href", "id": ":wdc:1501054740530"}
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: Finished Request
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: GET http://127.0.0.1:52986/wd/hub/session/71437af0-71d5-11e7-83de-29eb7aea97e9/element/:wdc:1501054740531/attribute/href {"sessionId": "71437af0-71d5-11e7-83de-29eb7aea97e9", "name": "href", "id": ":wdc:1501054740531"}
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: Finished Request
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: GET http://127.0.0.1:52986/wd/hub/session/71437af0-71d5-11e7-83de-29eb7aea97e9/element/:wdc:1501054740532/attribute/href {"sessionId": "71437af0-71d5-11e7-83de-29eb7aea97e9", "name": "href", "id": ":wdc:1501054740532"}
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: Finished Request
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: GET http://127.0.0.1:52986/wd/hub/session/71437af0-71d5-11e7-83de-29eb7aea97e9/element/:wdc:1501054740533/attribute/href {"sessionId": "71437af0-71d5-11e7-83de-29eb7aea97e9", "name": "href", "id": ":wdc:1501054740533"}
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: Finished Request
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: GET http://127.0.0.1:52986/wd/hub/session/71437af0-71d5-11e7-83de-29eb7aea97e9/element/:wdc:1501054740534/attribute/href {"sessionId": "71437af0-71d5-11e7-83de-29eb7aea97e9", "name": "href", "id": ":wdc:1501054740534"}
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: Finished Request
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: POST http://127.0.0.1:52986/wd/hub/session/71437af0-71d5-11e7-83de-29eb7aea97e9/elements {"using": "xpath", "sessionId": "71437af0-71d5-11e7-83de-29eb7aea97e9", "value": "//a[@rel='next']"}
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: Finished Request
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: POST http://127.0.0.1:52986/wd/hub/session/71437af0-71d5-11e7-83de-29eb7aea97e9/element {"using": "xpath", "sessionId": "71437af0-71d5-11e7-83de-29eb7aea97e9", "value": "//a[@rel='next']"}
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: Finished Request
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: GET http://127.0.0.1:52986/wd/hub/session/71437af0-71d5-11e7-83de-29eb7aea97e9/element/:wdc:1501054740535/attribute/href {"sessionId": "71437af0-71d5-11e7-83de-29eb7aea97e9", "name": "href", "id": ":wdc:1501054740535"}
- 2017-07-26 09:39:00 [selenium.webdriver.remote.remote_connection] DEBUG: Finished Request
- 2017-07-26 09:39:02 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.pcworld.com/article/2146081/healthcare-gov-users-required-to-change-passwords-heartbleed.html> (referer: http://www.pcworld.com/search?query=heartbleed)
- 2017-07-26 09:39:02 [selenium.webdriver.remote.remote_connection] DEBUG: POST http://127.0.0.1:52986/wd/hub/session/71437af0-71d5-11e7-83de-29eb7aea97e9/url {"url": "http://www.pcworld.com/article/2146081/healthcare-gov-users-required-to-change-passwords-heartbleed.html", "sessionId": "71437af0-71d5-11e7-83de-29eb7aea97e9"}
- 2017-07-26 09:39:33 [selenium.webdriver.remote.remote_connection] DEBUG: Finished Request
- 2017-07-26 09:39:33 [selenium.webdriver.remote.remote_connection] DEBUG: POST http://127.0.0.1:52986/wd/hub/session/71437af0-71d5-11e7-83de-29eb7aea97e9/element {"using": "xpath", "sessionId": "71437af0-71d5-11e7-83de-29eb7aea97e9", "value": "//h1[@itemprop='headline']"}
- 2017-07-26 09:39:33 [selenium.webdriver.remote.remote_connection] DEBUG: Finished Request
- 2017-07-26 09:39:33 [selenium.webdriver.remote.remote_connection] DEBUG: GET http://127.0.0.1:52986/wd/hub/session/71437af0-71d5-11e7-83de-29eb7aea97e9/element/:wdc:1501054773836/attribute/innerHTML {"sessionId": "71437af0-71d5-11e7-83de-29eb7aea97e9", "name": "innerHTML", "id": ":wdc:1501054773836"}
- 2017-07-26 09:39:33 [selenium.webdriver.remote.remote_connection] DEBUG: Finished Request
- 2017-07-26 09:39:33 [selenium.webdriver.remote.remote_connection] DEBUG: POST http://127.0.0.1:52986/wd/hub/session/71437af0-71d5-11e7-83de-29eb7aea97e9/element {"using": "xpath", "sessionId": "71437af0-71d5-11e7-83de-29eb7aea97e9", "value": "//meta[@name='date']"}
- 2017-07-26 09:39:33 [selenium.webdriver.remote.remote_connection] DEBUG: Finished Request
- 2017-07-26 09:39:33 [selenium.webdriver.remote.remote_connection] DEBUG: GET http://127.0.0.1:52986/wd/hub/session/71437af0-71d5-11e7-83de-29eb7aea97e9/element/:wdc:1501054773837/attribute/content {"sessionId": "71437af0-71d5-11e7-83de-29eb7aea97e9", "name": "content", "id": ":wdc:1501054773837"}
- 2017-07-26 09:39:33 [selenium.webdriver.remote.remote_connection] DEBUG: Finished Request
- 2017-07-26 09:39:33 [selenium.webdriver.remote.remote_connection] DEBUG: POST http://127.0.0.1:52986/wd/hub/session/71437af0-71d5-11e7-83de-29eb7aea97e9/elements {"using": "xpath", "sessionId": "71437af0-71d5-11e7-83de-29eb7aea97e9", "value": "//div[contains(@itemprop, 'articleBody')]//p"}
- 2017-07-26 09:39:33 [selenium.webdriver.remote.remote_connection] DEBUG: Finished Request
- 2017-07-26 09:39:33 [selenium.webdriver.remote.remote_connection] DEBUG: GET http://127.0.0.1:52986/wd/hub/session/71437af0-71d5-11e7-83de-29eb7aea97e9/element/:wdc:1501054773838/attribute/innerHTML {"sessionId": "71437af0-71d5-11e7-83de-29eb7aea97e9", "name": "innerHTML", "id": ":wdc:1501054773838"}
- 2017-07-26 09:39:33 [selenium.webdriver.remote.remote_connection] DEBUG: Finished Request
- 2017-07-26 09:39:33 [selenium.webdriver.remote.remote_connection] DEBUG: GET http://127.0.0.1:52986/wd/hub/session/71437af0-71d5-11e7-83de-29eb7aea97e9/element/:wdc:1501054773839/attribute/innerHTML {"sessionId": "71437af0-71d5-11e7-83de-29eb7aea97e9", "name": "innerHTML", "id": ":wdc:1501054773839"}
- 2017-07-26 09:39:33 [selenium.webdriver.remote.remote_connection] DEBUG: Finished Request
- 2017-07-26 09:39:33 [selenium.webdriver.remote.remote_connection] DEBUG: GET http://127.0.0.1:52986/wd/hub/session/71437af0-71d5-11e7-83de-29eb7aea97e9/element/:wdc:1501054773840/attribute/innerHTML {"sessionId": "71437af0-71d5-11e7-83de-29eb7aea97e9", "name": "innerHTML", "id": ":wdc:1501054773840"}
- 2017-07-26 09:39:33 [selenium.webdriver.remote.remote_connection] DEBUG: Finished Request
- 2017-07-26 09:39:33 [selenium.webdriver.remote.remote_connection] DEBUG: GET http://127.0.0.1:52986/wd/hub/session/71437af0-71d5-11e7-83de-29eb7aea97e9/element/:wdc:1501054773841/attribute/innerHTML {"sessionId": "71437af0-71d5-11e7-83de-29eb7aea97e9", "name": "innerHTML", "id": ":wdc:1501054773841"}
- 2017-07-26 09:39:33 [selenium.webdriver.remote.remote_connection] DEBUG: Finished Request
- 2017-07-26 09:39:33 [selenium.webdriver.remote.remote_connection] DEBUG: GET http://127.0.0.1:52986/wd/hub/session/71437af0-71d5-11e7-83de-29eb7aea97e9/element/:wdc:1501054773842/attribute/innerHTML {"sessionId": "71437af0-71d5-11e7-83de-29eb7aea97e9", "name": "innerHTML", "id": ":wdc:1501054773842"}
- 2017-07-26 09:39:33 [selenium.webdriver.remote.remote_connection] DEBUG: Finished Request
- 2017-07-26 09:39:33 [selenium.webdriver.remote.remote_connection] DEBUG: GET http://127.0.0.1:52986/wd/hub/session/71437af0-71d5-11e7-83de-29eb7aea97e9/element/:wdc:1501054773843/attribute/innerHTML {"sessionId": "71437af0-71d5-11e7-83de-29eb7aea97e9", "name": "innerHTML", "id": ":wdc:1501054773843"}
- 2017-07-26 09:39:33 [selenium.webdriver.remote.remote_connection] DEBUG: Finished Request
- 2017-07-26 09:39:33 [scrapy.core.scraper] DEBUG: Scraped from <200 http://www.pcworld.com/article/2146081/healthcare-gov-users-required-to-change-passwords-heartbleed.html>
- {'Article': u'If you have an account with HealthCare.gov, you can expect to change your password the next time you log in. And you can thank Heartbleed for it. According to the website, all HeathCare.gov users will be prompted to change their passwords the next time they log into the site. According to the site,\xa0"HealthCare.gov uses many layers of protections to secure your information," and theres no sign that any Healthcare.gov user information has been compromised, so this is mainly a precautionary measure. The Associated Press notes that the US Government is reviewing al of its sites to see if theyre vulnerable to the Heartbleed bug, so its possible that users of other government sites may have to change their passwords in the not-too-distant future. HealthCare.gov recommends using a password thats unique to your Healthcare.gov account. Some password managers, such as 1Password,\xa0can generate and store unique passwords that you dont need to memorize. But you dont need a password manager to devise stronger passwords: There are some tricks you can employ to create strong passwords that you can actually remember. See Alex Wawros guide to creating stronger passwords without losing your mind\xa0for one approach. And visit HealthCare.gov for more on that sites mandatory password change requirement. This story, "Blame Heartbleed: HealthCare.gov requires users to change their passwords" was originally published by TechHive.',
- 'Datum': u'2014-04-19',
- 'Original_URL': 'http://www.pcworld.com/article/2146081/healthcare-gov-users-required-to-change-passwords-heartbleed.html',
- 'Ueberschrift': 'Blame Heartbleed: HealthCare.gov requires users to change their passwords'}
- 2017-07-26 09:39:34 [scrapy.core.engine] DEBUG: Crawled (200) <GET http://www.pcworld.com/search?query=heartbleed&start=10> (referer: http://www.pcworld.com/search?query=heartbleed)
- 2017-07-26 09:39:35 [selenium.webdriver.remote.remote_connection] DEBUG: POST http://127.0.0.1:52986/wd/hub/session/71437af0-71d5-11e7-83de-29eb7aea97e9/url {"url": "http://www.pcworld.com/search?query=heartbleed&start=10", "sessionId": "71437af0-71d5-11e7-83de-29eb7aea97e9"}
Add Comment
Please, Sign In to add comment