Advertisement
Guest User

Untitled

a guest
Jan 7th, 2015
203
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
Ruby 0.63 KB | None | 0 0
  1. start = 0
  2.         keep_going = true
  3.  
  4.         while keep_going
  5.  
  6.             # use threads to contain memory consumption
  7.             thread = Thread.new do |t|
  8.  
  9.                 @search_result_page_num = start/10
  10.                 puts "google page #{start/10}"
  11.                 page = a.get("http://www.google.com/search?safe=off&start=#{start}&sa=N&q=#{search_str}")
  12.  
  13.                 page.links.each do |link|
  14.                     #do some stuff
  15.                 end
  16.  
  17.                 keep_going = false if page.body.include?('we have omitted some') || page.body.include?('did not match any documents')  
  18.                
  19.                 t.join
  20.                 Thread.exit
  21.             end
  22.  
  23.             # wait for thread
  24.             while thread.alive?
  25.                 sleep 1
  26.             end
  27.  
  28.             start += 10
  29.             sleep 5
  30.  
  31.         end
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement