20.19 Complexity Parameter

We can print a table of optimal prunings based on a complexity parameter using rpart::printcp(). The data is actually stored as model$cptable.

printcp(model)
## 
## Classification tree:
## rpart(formula=form, data=ds[tr, vars], model=TRUE)
## 
## Variables actually used in tree construction:
## [1] humidity_3pm rainfall    
## 
## Root node error: 34195/158807=0.21532
## 
## n= 158807 
## 
##         CP nsplit rel error  xerror      xstd
## 1 0.153560      0   1.00000 1.00000 0.0047903
## 2 0.034917      1   0.84644 0.84612 0.0044984
## 3 0.010000      3   0.77660 0.77927 0.0043549


Your donation will support ongoing availability and give you access to the PDF version of this book. Desktop Survival Guides include Data Science, GNU/Linux, and MLHub. Books available on Amazon include Data Mining with Rattle and Essentials of Data Science. Popular open source software includes rattle, wajig, and mlhub. Hosted by Togaware, a pioneer of free and open source software since 1984. Copyright © 1995-2022 Graham.Williams@togaware.com Creative Commons Attribution-ShareAlike 4.0