Advertisement
Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- 'http://www.basketball-reference.com/leagues/NBA_2014.html' -> url
- '#team' -> css_page
- url %>>%
- html %>>%
- html_nodes(css_page) %>>%
- html_table(header = F) %>>%
- data.frame() %>>%
- tbl_df() -> total_table
- total_table %>>%
- filter(X.1 == 'Rk') %>>% as.character -> names
- 'Rk' %>>% grep(x = total_table$X.1) -> row_of_header #ищем ранг
- names %>>% tolower -> names(total_table)
- names(total_table) %>>% (gsub('\\%|/','\\.',.)) -> names(total_table)
- (row_of_header + 1) %>>% (total_table[.:nrow(total_table),]) -> total_table #пропускаем этот ряд и со следующего идем до конца
- total_table %>>% head
- #This paste is a code from tutorial by Alex Bresler
- # http://asbcllc.com/blog/2014/november/creating_bref_scraper/
- # Comments translated into Russian for
- # http://www.datadrivenjournalism.ru/2015/03/webscrape-in-r/
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement