21.69 The Original C5.0 Implementation

The (Kuhn and Quinlan 2021) package interfaces the original C code of the C5.0 implementation by Ross Quinlan, the developer of the decision tree induction algorithm.

library(C50)
model <- C5.0(form, ds[tr, vars])
model
## 
## Call:
## C5.0.formula(formula = form, data = ds[tr, vars])
## 
## Classification Tree
## Number of samples: 123722 
## Number of predictors: 20 
## 
## Tree size: 808 
## 
## Non-standard options: attempt to group attributes
C5imp(model)
##                 Overall
## humidity_3pm     100.00
## wind_gust_speed   93.57
## rain_today        84.49
## cloud_3pm         81.41
## sunshine          41.09
## pressure_3pm      32.21
## rainfall          24.50
## pressure_9am      22.96
## wind_dir_3pm      17.48
## min_temp          13.16
## temp_3pm          10.08
## max_temp          10.08
## wind_gust_dir      8.76
## temp_9am           8.43
## wind_speed_3pm     8.14
## wind_dir_9am       7.94
## cloud_9am          7.52
## wind_speed_9am     7.25
## humidity_9am       7.19
## evaporation        4.13

% DONT EVAL YET - SEEMS TO BE TAKING LONG TIME

plot(model)

I am not aware of any converter from a C5.0 tree to an rpart tree and so fancyRpartPlot() will not be useful here.

References

Kuhn, Max, and Ross Quinlan. 2021. C50: C5.0 Decision Trees and Rule-Based Models. https://topepo.github.io/C5.0/.


Your donation will support ongoing development and give you access to the PDF version of this book. Desktop Survival Guides include Data Science, GNU/Linux, and MLHub. Books available on Amazon include Data Mining with Rattle and Essentials of Data Science. Popular open source software includes rattle, wajig, and mlhub. Hosted by Togaware, a pioneer of free and open source software since 1984.
Copyright © 1995-2021 Graham.Williams@togaware.com Creative Commons Attribution-ShareAlike 4.0.