Advertisement
SteveWeston

Sequential data table benchmark

Sep 13th, 2013
94
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
R 0.39 KB | None | 0 0
  1. library(data.table)
  2.  
  3. set.seed(107)
  4. n <- 1000000
  5. m <- 10000
  6. td <- data.frame(val=rnorm(n), id=sample(m, n, replace=TRUE))
  7.  
  8. start <- proc.time()[3]
  9. DT <- data.table(td)
  10. DT[, means := mean(DT[-.I, val]), by = id]
  11. elapsed <- proc.time()[3] - start
  12.  
  13. library(digest)
  14. res <- DT$means
  15. cat("Sequential data table version:\n")
  16. cat(sprintf("Elapsed time: %f, MD5 hash: %s\n", elapsed, digest(res)))
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement