Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- ################################################################################################################
- # ADDITIONAL LIBRARIES TO INSTALL IN UBUNTU
- #
- # for devtools: sudo apt-get install curl && sudo apt-get install libcurl4-gnutls-dev
- # for RMySQL: sudo apt-get install libmysqlclient-dev
- # for openssl: sudo apt-get install libssl-dev (this is needed also for many subsequent packages)
- # for rgdal: sudo aptitude install libproj-dev
- # for rgeos: sudo aptitude install libgdal-dev
- # for geojsonio: sudo apt-get install libv8-3.14-dev (must be installed AFTER rgdal & rgeos)
- # for XML: sudo apt-get install libxml2-dev
- # for rgl: sudo apt-get build-dep r-cran-rgl
- # for EBImage: sudo apt-get install libfftw3-dev
- # for rJava: sudo apt-get install openjdk-7-* && sudo R CMD javareconf
- # for diagrammeR: sudo apt-get install librsvg2-dev
- ################################################################################################################
- ### DATA I/O
- - for csv/tsv/txt files no package is required. Just use the core functions: read.csv, read.delim, read.table, read.fwf; or even the *scan* function
- - [readr](http://github.com/hadley/readr '') -
- - [rio](http://cran.r-project.org/web/packages/rio/ '') - Pull a lot of separate data-reading packages into one, so you just need to remember 2 functions: import and export.
- - [readxl](http:// '') - Excel files
- - [xlsx]( '') - Excel files
- - [XLConnect]( '') - Excel files
- - [openxlsx](http://github.com/awalker89/openxlsx '') - Excel files
- - [XML](http://cran.r-project.org/web/packages/XML/ '') - Read and create XML documents with R
- - [jsonlite](http://www.opencpu.org/posts/jsonlite-a-smarter-json-encoder/ '') - Create, read and write JSON data tables
- - [rjson]( '') -
- - [downloader](http://github.com/wch/downloader '') - Wrapper for base R download function that eases dealing with files over https
- - [googlesheets](http://github.com/jennybc/googlesheets '') - Easily read data into R from Google Sheets
- - [foreign](http:// '') - other (mostly proprietary) stats programs
- - [RMySQL](http://cran.r-project.org/web/packages/RMySQL/ '') - connector to a MySQL database
- - [RPostgresSQL](http:// '') - connector to a PostgreSQL database
- - [RODBC](http:// '') - connector to a database who does not have a standalone driver (like MSSQL, see [here]() for explanations on a linux box)
- - [rmongodb](http:// '') - connector to a mongoDB database
- - [RSQLite](http:// '') - connector to a SQLite database
- - [sqldf](http://github.com/ggrothendieck/sqldf '') - Run SQL queries on any data frame with [sqldf syntax](http://www.sqlite.org/lang.html '')
- ### DATA MANIPULATION (acting on atomics)
- - [stringr](http://github.com/hadley/stringr '') - Easy to learn tools for text manipulation, regular expressions included. Most functions are prefixed with `str_` so they are very easy to remember.
- - [lubridate](http://github.com/hadley/lubridate '') - Tools that make working with dates and times easier.
- - [qdapRegex](http:// '') - Collection of reg-expr tools built in the context of discourse analysis (see qdap package), though often useful outside of it
- ### DATA WRANGLING (including EDA) (acting on structures)
- - [data.table](http://rawgit.com/wiki/Rdatatable/data.table/vignettes/datatable-intro-vignette.html '') - An alternative way to organize data sets for very, very fast operations. Useful when dealing with large data sets.
- - [dplyr](http://cran.rstudio.com/web/packages/dplyr/ '') - Essential shortcuts for subsetting, summarizing, rearranging, and joining together data sets. dplyr is our go to package for fast data manipulation.
- - [plyr](http://plyr.had.co.nz/ '') - An easy to use substitute for *split-apply-combine* functionality in Base R: *split* a data structure into groups, *apply* a function on each group, *combine* and return the results in a possibly different data structure. Even if a bit outdated, still great when dealing with lists
- - [tidyr](http://github.com/hadley/tidyr '') - Tools for changing the layout of any dataframe. Use the gather and spread functions to convert your data into the tidy format, the layout R likes best.
- - [reshape2](http://github.com/hadley/reshape '') - Change data row and column formats from "wide" to "long"; turn variables into column names or column names into variables and more. Look at the tidyr package for a newer, more focused option
- - [purrr](https://github.com/hadley/purrr) - Completes R's functional programming tools with important features from other languages
- - [scales](http://cran.r-projects.com/web/packages/scales/ '') -
- - [xda](http:// '') - contains several tools to perform initial exploratory analysis on any input dataset.
- - [funModeling](http://github.com/pablo14/funModeling ) -
- - [gmodels](http://cran.r-project.org/web/packages/gmodels/) -
- - [rowr](https://cran.r-project.org/web/packages/rowr/) - Allows the manipulation of R objects as if they were organized rows in a way that is familiar to people used to working with databases.
- ### DATA IMPUTATION, OUTLIERS, ANOMALY DETECTION
- - [mice](http:// '')
- - [Amelia](http:// '')
- - [missForest](http:// '')
- - [Hmisc](http:// '')
- - [mi](http:// '')
- - [VIM](http:// '')
- - [rrcovNA](http:// '')
- - [mvnmle](http:// '')
- - [missMDA](http:// '')
- - [norm](http:// '')
- - [softimpute](http:// '')
- ### DATA DISPLAY
- - [xtable](http:// '') - The xtable function takes an R object (like a data frame) and returns the latex or HTML code you need to paste a pretty version of the object into your documents. Copy and paste, or pair up with R Markdown.
- - [DT](http://rstudio.github.io/DT/ '') - R interface to the open-source JS library [DataTables](https://datatables.net/)
- - [formattable](http://renkun.me/formattable/ '') -
- - [rhandsontable](http://jrowen.github.io/rhandsontable/ '') -
- - [sparktable]( '') -
- - [rpivotTable](http://github.com/smartinsightsfromdata/rpivotTable '') -
- - [D3TableFilter](http:// '')
- - [listviewer](http:// '')
- ### DATA VISUALIZATION
- - [ggplot2](http://docs.ggplot2.org/current/ '') - The most famous package for making beautiful graphics in R. ggplot2 lets you use the [grammar of graphics](http://vita.had.co.nz/papers/layered-grammar.pdf 'A paper by Hadley Wickham') to build layered, customizable plots
- - [ggExtra](http://github.com/daattali/ggExtra '') - Add marginal plots to ggplot2
- - [ggfortify](http://cran.r-project.org/web/packages/ggfortify/ '') - Data Visualization Tools for Statistical Analysis Results in a unified style using 'ggplot2'
- - [ggiraph](http://davidgohel.github.io/ggiraph/introduction.html) - It extends ggplot2 with new interactive geom functions
- - [ggvis](http:// '') - Interactive, web based graphics built like ggplot2 with the grammar of graphics (someone said that ggvis would have been the new version of ggplot2, but Hadley just released ggplot2 2 last december)
- - [rCharts](http://rcharts.io/ '') -
- - [dygraphs](http://rstudio.github.io/dygraphs/ '') - Wrapper for the JS library graphs of time series, one-line command if your data is an xts object.
- - [plotly](http://plot.ly/r/ '') - R interface to the open-source JS library [Plotly](http://plot.ly/javascript/). It includes a ggplotly() function to convert on the fly graphs created with ggplot2
- - [highcharter](http:// '') - R interface to the open-source JS library [Highcharts](http://www.highcharts.com/)
- - [rbokeh](http://hafen.github.io/rbokeh/ '') - R interface to the open-source JS library [Bokeh](http://bokeh.pydata.org/)
- - [d3heatmap](https://github.com/rstudio/d3heatmap '') -
- - [metricsgraphics](http://hrbrmstr.github.io/metricsgraphics/ '') - R interface to the JS library [metricsgraphics](http://metricsgraphicsjs.org/) for bare-bones line, scatterplot and bar charts
- - [scatterD3](http://github.com/juba/scatterD3 '') - Interactive scatter plots visualization
- - [rcdimple](http://github.com/timelyportfolio/rcdimple '') - R interface to the JS library [dimple]()
- - [timevis](http://github.com/daattali/timevis '') - Interactive timeline visualizations in R.
- - [wordcloud2](http://github.com/lchiffon/wordcloud2 ) - R interface to the JS library [wordcloud2](https://github.com/timdream/wordcloud2.js)
- - [animation]( '') -
- - [misc3d]() - Miscellaneous functions for three dimensional plots
- - [threejs](http://github.com/bwlewis/rthreejs '') - Interactive 3D scatterplots and 3D globes.
- - [rgl](http:// '') - Interactive 3D visualizations with R
- - [animint](http://github.com/tdhock/animint '')
- - [googleVis](http://github.com/mages/googleVis '') - wrapper for the Google Chart API
- - [hexbin]( '') - provides bivariate binning into hexagonal cells, in an attempt to override overlapping in scatterplots
- - [RColorBrewer](http://cran.r-project.org/web/packages/RColorBrewer/index.html '') - RColorBrewer helps you select adequate color palettes for any visualization. See also the original [ColorBrewer](http://colorbrewer2.org/ '') website
- - [Rgraphviz]( '') - provides plotting capabilities for R graph objects
- ### DATA MODELING
- - [stats](http://stat.ethz.ch/R-manual/R-devel/library/stats/html/00Index.html) - Contains functions for statistical calculations and random number generation
- - [broom](http://cran.r-project.org/web/packages/broom/ '') - Convert Statistical Analysis Objects into Tidy Data Frames
- - [car](http:// '') - package linked to the book **An R Companion to Applied Regression**
- - [rms](http://biostat.mc.vanderbilt.edu/wiki/Main/RmS '') -
- - [mgcv](http:// '') - Generalized Additive Models
- - [lme4/nlme](http:// '') - Linear and Non-linear mixed effects models
- - [multcomp](http:// '') - Tools for multiple comparison testing
- - [vcd](http:// '') - Visualization tools and tests for categorical data
- - [glmnet](http://cran.r-project.org/web/packages/glmnet/ '') - Lasso and elastic-net regularized GLM with cross validation
- - [lars](https://cran.r-project.org/web/packages/lars/ '') -
- - [survival](http:// '') - Tools for survival analysis
- - [FFTrees]() - Fast and frugal trees make very fast decisions based on a few pieces of information and ignore all others
- - [dismo](http:// '') - Boosted Regression Trees for ecological modeling
- ### DATA MINING, MACHINE LEARNING, NEURAL NETWORK, TEXT ANALYTICS, IMAGE RECOGNITION
- - [mlr](http://github.com/mlr-org/mlr '') - Interface to a large number of classification and regression techniques. [Here](http://mlr-org.github.io/mlr-tutorial/release/html/) for a tutorial
- - [class](http://cran.r-project.org/web/packages/class/ '') - Various functions for classification, including KNN, LVQ and SOM.
- - [caret](http://cran.r-project.org/web/packages/caret/ '') - Tools for Classification And REgression Training models, with the intent to combine model training and prediction. A set of functions that attempt to streamline the process for creating predictive models
- - [klaR](http://cran.r-project.org/web/packages/klaR '') - Miscellaneous functions for classification and visualization
- - [ROCR](http://cran.r-project.org/web/packages/ROCR/) - Visualizing the performance of scoring classifiers
- - [pROC](http://cran.r-project.org/web/packages/pROC/ '') - Display and Analyze ROC Curves
- - [randomForest](http://cran.r-project.org/web/packages/randomForest/ '') - Classification methods used to create large number of decision trees, then each observation is inputted into the decision tree. The common output obtained for maximum of the observations is considered as the final output.
- - [e1071](http://cran.r-project.org/web/packages/e1071/ '') - Latent class analysis, support vector machine, fuzzy clustering, Fourier transforms, shortest path computation, bagged clustering, naive Bayes classifier, ...
- - [tree](http://cran.r-project.org/web/packages/tree/ '') - Classification and regression trees.
- - [rpart](http://cran.r-project.org/web/packages/rpart/ '') - Recursive Partitioning And Regression Trees: classification/regression models using a two stage procedure, with the resultant model represented in the form of binary trees
- - [party](http://cran.r-project.org/web/packages/party/ '') - recursive partitioning, using ensemble methods, to build decision trees based on Conditional Inference algorithm
- - [partykit](http://cran.r-project.org/web/packages/partykit/ '') - A Toolkit for Recursive Partytioning
- - [arules](http://cran.r-project.org/web/packages/arules/ '') - Mining Association Rules and Frequent Itemsets
- - [nnet](http://cran.r-project.org/web/packages/nnet/ '') - Feed-forward Neural Networks and Multinomial Log-Linear Models
- - [neuralnet](http://cran.r-project.org/web/packages/neuralnet/ '') - Training of neural networks using back-propagation
- - [kknn](https://cran.r-project.org/web/packages/kknn/ '') - Weighted k-Nearest Neighbors for Classification, Regression and Clustering
- - [kernlab](http://cran.r-project.org/web/packages/kernlab/ '') - KERNel-based Machine Learning LABoratory
- - [C50](http:// '') -
- - [gbm](http://cran.r-project.org/web/packages/gbm/ '') - Gradient Boosting Machine
- - [AppliedPredictiveModeling](http:// '') -
- - [earth](http:// '') -
- - [mda](http:// '') -
- - [tau](http://cran.r-project.org/web/packages/tau/ '') - Text Analysis Utilities
- - [ada](http://cran.r-project.org/web/packages/ada/) - Stochastic Boosting
- - [adabag](http://www.jstatsoft.org/index.php/jss/article/view/v054i02/adabag_An_R_Package_for_Classification_with_Boosting_and_Bagging.pdf) - Classification with Boosting and Bagging
- - [RoogleVision](http:// '') - a Package for Image Recognition
- ### DATA REPORTING
- - [shiny](http://shiny.rstudio.com/, 'shiny ') - Easily make interactive, web apps with R. A perfect way to explore data and share findings with non-programmers.
- - [shinyDashboard](http://rstudio.github.io/shinydashboard/ '') - Makes it easy to use Shiny to create dashboards-like apps.
- - [shinythemes](http://rstudio.github.io/shinythemes/ '') - Makes it easy to alter the overall appearance of any Shiny app
- - [shinyjs](http://github.com/daattali/shinyjs '') - It lets you perform common useful JS operations in Shiny apps without having to actually know any JS
- - [shinyBS](http://ebailey78.github.io/shinyBS/index.html '') - Add additional functionality and interactivity to Shiny apps, like Alerts, Tooltips and Popovers.
- - [rmarkdown](http://rmarkdown.rstudio.com/, 'rmarkdown') - The perfect workflow for reproducible reporting. Write R code in your markdown reports. When you run render, R Markdown will replace the code with its results and then export your report as an HTML, pdf, or MS Word document, or a HTML or pdf slideshow. The result? Automated reporting. R Markdown is integrated straight into RStudio.
- - [flexdashboard](http://rmarkdown.rstudio.com/flexdashboard/ '') - A flexible and easy way to specify row and column-based layouts, to publish a group of related data visualizations as a dashboard.
- - [bookdown](http://bookdown.org/yihui/bookdown/ ) - This is not what many people would a *fundamental* package, but if you are in the mood to author a book
- ### WEB, DIGITAL ANALYTICS, SOCIAL MEDIA MINING
- - [RCurl]( '') - Composition of general HTTP requests, functions to fetch URLs, to get and post web data
- - [httr](http:// '') - A set of useful tools for working with http connections
- - [rvest](http://cran.r-project.org/web/packages/rvest/vignettes/selectorgadget.html '') Web scraping: Extract data from HTML pages. Works well with Selectorgadget.
- - [RGoogleAnalytics](http://www.tatvic.com/blog/google-analytics-data-extraction-in-r/ ) - A bridge to **Google Analytics**
- - [GAR](http://github.com/andrewgeisler/GAR ) - Another interface to **Google Analytics**
- - [rga](http://github.com/skardhamar/rga/ '') - One more interface to **Google Analytics**
- - [RSiteCatalyst](http://randyzwitch.com/rsitecatalyst/ '') - An interface to **Adobe Analytics**
- ### MATRIX CALCULUS
- - [Matrix](http://cran.r-project.org/web/packages/Matrix/ '') - Sparse and Dense Matrix Classes and Methods
- - [svd](https://cran.r-project.org/web/packages/svd/ '') - Interface to Lanczos SVD and eigensolvers from R
- - [irlba](http://illposed.net/irlba.pdf '') - Provides a fast way to compute partial SVDs and principal component analyses of very large scale data
- ### TIME SERIES, FORECASTING, FINANCIAL DATA, QUANT
- - [zoo](http://cran.r-project.org/web/packages/zoo/) - Provides the most popular format for saving and handling with time series objects in R.
- - [xts](http://) - Very flexible tools for manipulating time series data sets.
- - [forecast]() - forecast makes it incredibly easy to fit time series models like ARIMA, ARMA, AR, Exponential Smoothing, etc.
- - [quantmod](http://www.quantmod.com/examples/intro/) - Tools for downloading financial data, plotting common charts, and doing technical analysis.
- - [TTR](http://cran.r-project.org/web/packages/TTR/) - Fast and Flexible Technical Analysis
- - [qundl](http://www.quandl.com/tools/r/ '') -
- - [fMultivar](http://cran.r-project.org/web/packages/fMultivar/fMultivar.pdf '') - Analysis and Modeling of Multivariate Financial Return Distributions
- ### GRAPHS, NETWORKS
- - [RNeo4j](http://github.com/nicolewhite/RNeo4j) -
- - [igraph](http://igraph.org/r/) - A collection of network analysis tools
- - [visNetwork](http://datastorm-open.github.io/visNetwork/) - R interface to the open-source JS library [vis.js](http://visjs.org/)
- - [network3D](http://christophergandrud.github.io/networkD3/) - network graphs
- - [DiagrammeR](http://rich-iannone.github.io/DiagrammeR/) - R interface to the open-source JS libraries [mermaid.js](http://github.com/knsv/mermaid) and [Graphviz](http://www.graphviz.org/), capable of generating diagrams and flowcharts from text in a similar manner as markdown.
- ### SPATIAL DATA (the emphasis here is on economic and social applications)
- #### GENERIC
- - [sp](http://cran.rstudio.com/web/packages/sp/index.html '') - Provides classes and methods for spatial data
- - [rgdal](http:// '') - R’s interface to the popular C/C++ *Geospatial Abstraction Library* [GDAL](http://www.gdal.org/ ''), that enables R to handle a broader range of spatial data formats.
- - [rgeos](http:// '') - Tools for handling spatial operations on topologies. R’s interface to the powerful vector processing library [geos](http://trac.osgeo.org/geos/ '')
- - [PSBmapping]( '') -
- #### MAPPING
- - [maptools](http:// '') - provides various functions for manipulating and reading spatial data
- - [maps](http:// '') - Easy to use map polygons for plots
- - [mapview](https://github.com/environmentalinformatics-marburg/mapview '') - Interactive visualization of spatial objects in R
- - [tmap](https://github.com/mtennekes/tmap '') - Quick and easy thematic mapping in R
- - [ggmap](http:// '') - extends the plotting package ggplot2 for maps. It enables downloading street maps from Google/OS maps, and use them as a background.
- - [leaflet](http://rstudio.github.io/leaflet/ '') - Interactive mapping tools as a wrapper for JS leaflet, from htmlwidgets
- - [GISTools](http://cran.r-project.org/web/packages/GISTools/GISTools.pdf '') - Mapping and spatial data manpulation tools
- - [OpenStreetMap]( '') - Access high res raster maps and satellite imagery to use as a background
- - [RgoogleMaps]( '') - Easily maps any data onto Google Map tiles
- - [googleVis]( '') - Generic package for data viz that contains some functions specifically targeted for mapping
- - [choroplethr](http:// '') - mapping tool
- - [RWorldMap]( '') - lets map easily global data
- - [raster]( '') - Functions for I/O, manipulating and modeling of gridded rasters or spatial data
- - [rasterVis]( '') - raster visualization
- =#### ANALYSIS, GEOCOMPUTATION
- - [gstat](http://cran.rstudio.com/web/packages/gstat/index.html '') - Functions for spatital and spatio-temporal geostatistical modeling, prediction and simulation
- - [geoR]( '') - Geostatistical analysis
- - [GeoXp]( '') - Interactive exploratory spatial data analysis
- - [spatstat]( '') - Spatial Point Pattern analysis
- - [spdep]( '') - A collection of functions and tests for evaluating spatial dependence
- - [GWmodel]() - Geographically weighted models
- - [spatgraphs]( '') - Graph Edge Computations for Spatial Point Patterns
- - [spacetime](http://cran.rstudio.com/web/packages/spacetime/index.html '') -
- - [trajectories](http://cran.rstudio.com/web/packages/trajectories/index.html '') -
- - [akima]( '') - for spline interpolation
- - [deldir](http://cran.r-project.org/web/packages/deldir/deldir.pdf '') - Functions to calculate and manipulate Delaunay Triangulations and Dirichlet or Voronoi tessellations of points datasets
- ### PROBABILITY, SIMULATION, COMPUTATIONAL STATISTICS
- - [boot]( '') - bootstrap calculations
- - [deSolve]( '') -
- - [cvTools]( '') -
- - [laeken]( '') -
- - [rlecuyer]( '') -
- ### HIGH PERFORMANCE
- - [Rcpp](http:// '') - Write R functions that call C++ code for lightning fast speed.
- - [parallel](http:// '') - Use parallel processing in R to speed up your code or to crunch large data sets.
- - [foreach](http://cran.r-project.org/web/packages/foreach/ '') -
- - [doMC](http://cran.r-project.org/web/packages/doMC/ '') - Multicore processing
- - [multicore]( '') -
- - [doSNOW ]( '') -
- - [SOAR](http://cran.r-project.org/web/packages/SOAR/ '') - Memory management by delayed assignments
- - [rbenchmark]( '') -
- ### CLOUD
- - [bigrquery](https://github.com/rstats-db/bigrquery '') - provides an R interface to [Google BigQuery](https://developers.google.com/bigquery/ '')
- ### PACKAGE DEVELOPMENT
- - [devtools](http://github.com/hadley/devtools/ '') - An essential suite of tools for turning code into an R package. Rstudio has designed cheatsheet [here](http://www.rstudio.com/wp-content/uploads/2015/06/devtools-cheatsheet.pdf)
- - [testthat](http:// '') - An easy way to write unit tests for any code projects.
- - [roxygen2](http://roxygen.org/#documentation '') - A quick way to document any R packages. roxygen2 turns inline code comments into documentation pages and builds a package namespace.
- - [htmlwidgets](http://www.htmlwidgets.org/ '') - A fast way to build interactive (javascript based) displays and visualizations. See also the [gallery](http://gallery.htmlwidgets.org/)
- - [installr](http://github.com/talgalili/installr/ '') - (Windows only) Allows to update the installed version of R from within R
- ### BIOCONDUCTOR
- [Bioconductor](https://www.bioconductor.org/) provides tools for the analysis and comprehension of high-throughput genomic data. The `biocLite()` command is the recommended way to install *Bioconductor* packages: *Bioconductor* has a repository and release schedule that differs from *R*. Run `source('http://bioconductor.org/biocLite.R')` to get the latest version of *Bioconductor*.
- - [IRanges]( '') -
- - [AnnotationDbi]( '') -
- ### MISCELLANEA
- - [HSAUR](http://cran.r-project.org/web/packages/HSAUR/ '') - Functions, data sets, analyses and examples from the book *A Handbook of Statistical Analyses Using R*
- - [MASS](http://cran.r-project.org/web/packages/MASS) - Functions and datasets to support the book [Modern Applied Statistics with S](http://www.stats.ox.ac.uk/pub/MASS4/)
- - [swirl]( '')
- ### EXTERNAL RESOURCES
- - [font awesome](http://fontawesome.io/icons/ '')
- - [Glyphicons](http://getbootstrap.com/components/#glyphicons '')
Advertisement
Add Comment
Please, Sign In to add comment