Data Science Desktop Survival Guide
by
Graham Williams
Desktop Survival
Project Home
Preface
Data Science
Introducing R
R Constructs
R Tasks
R Strings
R Read, Write, and Create
Data Template
Data Exploration
Data Wrangling
Data Visualisation
Statistics
ML Template
ML Scenarios
ML Activities
ML Applications
ML Algorithms
Cluster Analysis
Decision Trees
Computer Vision
Graph Data
Privacy
Literate Data Science
Coding with Style
Resources
Bibliography
Index
CLICK HERE TO VISIT THE UPDATED SURVIVAL GUIDE
Bibliography
Bibliography
Aragon TJ (2020).
epitools: Epidemiology Tools
.
R package version 0.5-10.1, URL
https://CRAN.R-project.org/package=epitools
.
Bache SM, Wickham H (2020).
magrittr: A Forward-Pipe Operator for R
.
R package version 2.0.1, URL
https://CRAN.R-project.org/package=magrittr
.
Breiman L, Cutler A, Liaw A, Wiener M (2018).
randomForest: Breiman and Cutler's Random Forests for Classification and Regression
.
R package version 4.6-14, URL
https://www.stat.berkeley.edu/~breiman/RandomForests/
.
Dahl DB, Scott D, Roosen C, Magnusson A, Swinton J (2019).
xtable: Export Tables to LaTeX or HTML
.
R package version 1.8-4, URL
http://xtable.r-forge.r-project.org/
.
Dragulescu A, Arendt C (2020).
xlsx: Read, Write, Format Excel 2007 and Excel 97/2000/XP/2003 Files
.
R package version 0.6.5, URL
https://github.com/colearendt/xlsx
.
Durant W (1926).
The Story of Philosophy
.
2012 edition. Simon and Schuster.
Firke S (2021).
janitor: Simple Tools for Examining and Cleaning Dirty Data
.
R package version 2.1.0, URL
https://github.com/sfirke/janitor
.
Gagolewski M, Tartanus B, , other contributors; IBM, Unicode, Inc, other contributors; Unicode, Inc (2020).
stringi: Character String Processing Facilities
.
R package version 1.5.3, URL
https://CRAN.R-project.org/package=stringi
.
Gandrud C (2014).
Repreoducible Research with R and RStdio
.
the R Series. CRC Press.
Harrell Jr FE (2020).
Hmisc: Harrell Miscellaneous
.
R package version 4.4-2, URL
https://CRAN.R-project.org/package=Hmisc
.
Hester J (2020).
glue: Interpreted String Literals
.
R package version 1.4.2, URL
https://CRAN.R-project.org/package=glue
.
Hornik K (2020).
RWeka: R/Weka Interface
.
R package version 0.4-42, URL
https://CRAN.R-project.org/package=RWeka
.
Hothorn T, Hornik K, Strobl C, Zeileis A (2020).
party: A Laboratory for Recursive Partytioning
.
R package version 1.3-5, URL
http://party.R-forge.R-project.org
.
Hothorn T, Zeileis A (2020).
partykit: A Toolkit for Recursive Partytioning
.
R package version 1.2-11, URL
http://partykit.r-forge.r-project.org/partykit/
.
Kaiser S, Santamaria R, Khamiakova T, Sill M, Theron R, Quintales L, Leisch F, De Troyer E (2020).
biclust: BiCluster Algorithms
.
R package version 2.0.2, URL
https://CRAN.R-project.org/package=biclust
.
Keitt T (2012).
colorRamps: Builds color tables
.
R package version 2.3, URL
https://CRAN.R-project.org/package=colorRamps
.
Knuth DE (1984).
“Literate Programming.”
The Computer Journal (British Computer Society)
,
27
(2), 97–111.
URL
http://www.literateprogramming.com/knuthweb.pdf
.
Kuhn M, Quinlan R (2020).
C50: C5.0 Decision Trees and Rule-Based Models
.
R package version 0.1.3.1, URL
https://topepo.github.io/C5.0
.
Liu Z, Xia X, Treude C, Lo D, Li S (2019).
“Automatic Generation of Pull Request Descriptions.”
2019 34th IEEE/ACM International Conference on Automated Software Engineering (ASE)
.
doi: rm10.1109/ase.2019.00026.
URL
http://dx.doi.org/10.1109/ASE.2019.00026
.
Milborrow S (2020).
rpart.plot: Plot rpart Models: An Enhanced Version of plot.rpart
.
R package version 3.0.9, URL
http://www.milbo.org/rpart-plot/index.html
.
Neuwirth E (2014).
RColorBrewer: ColorBrewer Palettes
.
R package version 1.1-2, URL
https://CRAN.R-project.org/package=RColorBrewer
.
Ooms J (2020).
writexl: Export Data Frames to Excel xlsx Format
.
R package version 1.3.1, URL
https://CRAN.R-project.org/package=writexl
.
Qin D, Zhou X, Chen L, Huang G, Zhang Y (2020).
“Dynamic Connection-Based Social Group Recommendation.”
IEEE Transactions on Knowledge and Data Engineering
,
32
(3), 453–467.
R Core Team (2018).
R: A Language and Environment for Statistical Computing
.
R Foundation for Statistical Computing, Vienna, Austria.
URL
https://www.R-project.org/
.
R Core Team (2020).
R: A Language and Environment for Statistical Computing
.
R Foundation for Statistical Computing, Vienna, Austria.
URL
https://www.R-project.org/
.
Rinker T (2018).
wakefield: Generate Random Data Sets
.
R package version 0.3.3, URL
https://github.com/trinker/wakefield
.
Ripley B (2020).
nnet: Feed-Forward Neural Networks and Multinomial Log-Linear Models
.
R package version 7.3-14, URL
http://www.stats.ox.ac.uk/pub/MASS4/
.
Romanski P, Kotthoff L (2018).
FSelector: Selecting Attributes
.
R package version 0.31, URL
https://CRAN.R-project.org/package=FSelector
.
Sarkar D (2020).
lattice: Trellis Graphics for R
.
R package version 0.20-41, URL
http://lattice.r-forge.r-project.org/
.
Schloerke B, Cook D, Larmarange J, Briatte F, Marbach M, Thoen E, Elberg A, Crowley J (2021).
GGally: Extension to ggplot2
.
R package version 2.1.0, URL
https://CRAN.R-project.org/package=GGally
.
Sing T, Sander O, Beerenwinkel N, Lengauer T (2020).
ROCR: Visualizing the Performance of Scoring Classifiers
.
R package version 1.0-11, URL
http://ipa-tys.github.io/ROCR/
.
Soetaert K (2020).
diagram: Functions for Visualising Simple Graphs (Networks), Plotting Flow Diagrams
.
R package version 1.6.5, URL
https://CRAN.R-project.org/package=diagram
.
Spinu V, Grolemund G, Wickham H (2020).
lubridate: Make Dealing with Dates a Little Easier
.
R package version 1.7.9.2, URL
https://CRAN.R-project.org/package=lubridate
.
Therneau T, Atkinson B (2019).
rpart: Recursive Partitioning and Regression Trees
.
R package version 4.1-15, URL
https://CRAN.R-project.org/package=rpart
.
Velten K (2009).
Mathematical Modeling and Simulation
.
Wiley.
ISBN 978-3-527-40758-3.
Wickham H (2014).
Advanced R
.
Chapman & Hall/CRC The R Series. Chapman & Hall.
Wickham H (2019a).
lobstr: Visualize R Data Structures with Trees
.
R package version 1.1.1, URL
https://github.com/r-lib/lobstr
.
Wickham H (2019b).
stringr: Simple, Consistent Wrappers for Common String Operations
.
R package version 1.4.0, URL
https://CRAN.R-project.org/package=stringr
.
Wickham H (2020).
tidyr: Tidy Messy Data
.
R package version 1.1.2, URL
https://CRAN.R-project.org/package=tidyr
.
Wickham H, Bryan J (2019).
readxl: Read Excel Files
.
R package version 1.3.1, URL
https://CRAN.R-project.org/package=readxl
.
Wickham H, Chang W, Henry L, Pedersen TL, Takahashi K, Wilke C, Woo K, Yutani H, Dunnington D (2020).
ggplot2: Create Elegant Data Visualisations Using the Grammar of Graphics
.
R package version 3.3.3, URL
https://CRAN.R-project.org/package=ggplot2
.
Wickham H, François R, Henry L, Müller K (2021).
dplyr: A Grammar of Data Manipulation
.
R package version 1.0.3, URL
https://CRAN.R-project.org/package=dplyr
.
Wickham H, Hester J (2020).
readr: Read Rectangular Text Data
.
R package version 1.4.0, URL
https://CRAN.R-project.org/package=readr
.
Wickham H, Seidel D (2020).
scales: Scale Functions for Visualization
.
R package version 1.1.1, URL
https://CRAN.R-project.org/package=scales
.
Williams GJ (1989).
“FrameUp: A frames formalism for expert systems.”
Australian Computer Journal
,
21
(1), 33–40.
URL
http://togaware.com/papers/acj89_heffe.pdf
.
Williams GJ (2011).
Data Mining with Rattle and R: The art of excavating data for knowledge discovery.
Use R! Springer, New York.
Williams GJ (2017).
The Essentials of Data Science: Knowledge discovery using R
.
The R Series. CRC Press.
Williams GJ (2020).
rattle: Graphical User Interface for Data Science in R
.
R package version 5.4.7, URL
https://rattle.togaware.com/
.
Xie Y (2014).
Dynamic Documents with R and knitr
.
The R Series. CRC Press.
Xie Y (2020).
knitr: A General-Purpose Package for Dynamic Report Generation in R
.
R package version 1.30, URL
https://yihui.org/knitr/
.
Support further development by purchasing the
PDF version of the book
.
Other online resources include the
GNU/Linux Desktop Survival Guide
.
Books available on Amazon include
Data Mining with Rattle
and
Essentials of Data Science
.
Popular open source software includes
rattle
and
wajig
.
Hosted by
Togaware
, a pioneer of free and open source software since 1984.
Copyright © 2000-2020 Togaware Pty Ltd.
. Creative Commons ShareAlike V4
.