One Page R: A Survival Guide to Data Science with R

*
Graham Williams
International Visiting Professor
Chinese Academy of Sciences
Shenzhen Institutes of Advanced Technology
*

Welcome to **OnePageR**. These chapters weave together a collection of tools
for the data scientist. The tools are all part of the R Statistical Software Suite.

Each *OnePageR* chapter is actually made up of multiple pages!
Each page within a chapter is a one page guide that covers a particular
aspect of the topic under review.

The *OnePageR*s can be worked through as a hands-on guide and
then used as a reference guide. Each page aims to be a bite sized
chunk for hands-on learning, building on what has gone before. Many
chapters also have a lecture pack and a laboratory session where a
number of tasks can be completed.

The R code sitting behind each OnePageR chapter is also provided and can be easily run standalone to replicate the material presented in the chapter.

The material is always **under development**! Chapters will
change (and hopefully improve) regularly, but links preceded with a *
are more well developed. **Feedback, suggestions, and ideas are more
than welcome.**

Refer to the Data Mining Survival Guide or my book on Data Mining with Rattle and R: The Art of Excavating Data for Knowledge Discovery (Use R) for related material.

Many of the initial chapters were developed and tested whilst visiting the Shenzhen Institutes of Technology as an International Visiting Professor of the Chinese Academy of Sciences.

The data used across the chapters is available for download as data.zip.

Enjoy!

- Getting Started as a Data Scientist
**Getting Started with R and Rattle:***Lecture - *Laboratory**Introducing and Interacting with R:***Lecture - *Laboratory- BasicR - OnePage(R) - Writing R scripts
- R for the Data Scientist
- Dealing With Data
**Reading Data into R:***OnePageR - *R**Exploring and Summarising Data:***OnePageR - *R**Visualising Data with GGPlot2:***OnePageR - *R**Transforming Data:***OnePageR - *R- Descriptive Analytics
- Predictive Analytics
**Decision Trees:***Lecture - *OnePageR - *R - *Rattle**Ensembles of Decision Trees:***Lecture - *OnePageR - *R- SVM (R)
- KernLab (R)
- NeuralNetworks (R)
- NNet (R)
**Evaluating Models:***OnePageR - *R- Evaluation (R)
- Scoring (R)
- PMML (R) Exporting Models for Deployment
- Advanced Analytics
- Advanced R
**Strings:**OnePageR, R**Dates and Time:**(PDF, R) Dates and Time**Spatial Data***OnePageR - *R
(R)
Spatial Analysis
**Big Data***OnePageR - *R**Plots**(PDF, R) Miscellaneous Plots**Functions**(PDF, R) Writing Functions in R**Parallel Processing**(PDF, R) Parallel Execution- Expert R

OnePageR is provided under a Creative Commons Attribution-ShareAlike 3.0 Unsupported License allowing access to everyone for any purpose, and is provided at no cost. You can assist in helping cover the costs of providing this material through a $40 PayPal payment. In return we will provide you with a single PDF compilation version of the material. Your support also encourages further development of this resource.

Other great resources for modular approaches to learning R include:

Other Togaware resources:

- CUNY NSF Workshop - March 2014
- AusDM-2013 Tutorial - November 2013
- IDEAL-2013 Tutorial - October 2013

Local package archive:

`install.packages("rattle", repos="http://rattle.togaware.com", type="source")``install.packages("wsrf", repos="http://rattle.togaware.com", type="source")``install.packages("wsrpart", repos="http://rattle.togaware.com", type="source")``install.packages("wskm", repos="http://rattle.togaware.com", type="source")`

OnePageR by Graham
Williams is Copyright © 2012 – 2014 Togaware Pty Ltd

Licensed
under a Creative Commons
Attribution-ShareAlike 3.0 Unported License.

This site is hosted in the cloud by Web Faction.

*Last Modified 2014-04-06 19:31:38 gjw*