mechnicov

Untitled

Jul 30th, 2020 (edited)
27
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 0.74 KB | None | 0 0
  1. HOW_MANY_PAGES = 10
  2. URL_PATTERN = 'https://rostov.hh.ru/search/vacancy?L_is_autosearch=false&area=113&clusters=true&enable_snippets=true&order_by=publication_time&schedule=remote&page=%d'
  3.  
  4. def scrape
  5. (1..HOW_MANY_PAGES).flat_map do |i|
  6. url = URL_PATTERN % i
  7.  
  8. puts "Parsing page #{i}: #{url}"
  9. page = Nokogiri::HTML(URI.open(url, &:read))
  10.  
  11. page.css('div.vacancy-serp-item').map do |element|
  12. {
  13. position: element.css('span.g-user-content').text,
  14. salary: element.css('div.vacancy-serp-item__sidebar').text,
  15. company: element.css('div.vacancy-serp-item__meta-info').text,
  16. work_desc: element.css('div.g-user-content').text,
  17. link: element.at_css('a')['href']
  18. }
  19. end
  20. end
  21. end
  22.  
Advertisement
Add Comment
Please, Sign In to add comment