Advertisement
Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- puts "AngryAngry Spider v0.1"
- puts "Url ripping started"
- inital_url = "www.google.com"
- port = 80
- start_page="/index.html"
- stripped_links[]
- request = "GET #{start_page} HTTP/1.0\r\n\r\n"
- spider = TCPSocket.open(host, port)
- spider.print(request)
- initial_page = spider.read
- headers,body = initial_page.split("\r\n\r\n", 2)
- i = 0
- for webpage.scan(/http:\/\/\w+/).each |link|
- stripped_links[i] = link
- puts link
- i = i + 1
- end
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement