Advertisement
ansakoy

Bresler4

Mar 6th, 2015
3,166
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
R 0.88 KB | None | 0 0
  1. 'http://www.basketball-reference.com/leagues/NBA_2014.html' -> url
  2. '#team' -> css_page
  3. url %>>%
  4.     html %>>%
  5.     html_nodes(css_page) %>>%
  6.     html_table(header = F) %>>%
  7.     data.frame() %>>%
  8.     tbl_df() -> total_table
  9. total_table %>>%
  10.     filter(X.1 == 'Rk') %>>% as.character -> names
  11. 'Rk' %>>% grep(x = total_table$X.1) -> row_of_header #ищем ранг
  12. names %>>% tolower -> names(total_table)
  13. names(total_table) %>>% (gsub('\\%|/','\\.',.)) -> names(total_table)
  14. (row_of_header + 1) %>>% (total_table[.:nrow(total_table),]) -> total_table #пропускаем этот ряд и со следующего идем до конца
  15. total_table %>>% head
  16.  
  17. #This paste is a code from tutorial by Alex Bresler
  18. # http://asbcllc.com/blog/2014/november/creating_bref_scraper/
  19. # Comments translated into Russian for
  20. # http://www.datadrivenjournalism.ru/2015/03/webscrape-in-r/
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement