21.69 The Original C5.0 Implementation

The (Kuhn and Quinlan 2020) package interfaces the original C code of the C5.0 implementation by Ross Quinlan, the developer of the decision tree induction algorithm.

library(C50)
model <- C5.0(form, ds[tr, vars])
model
## 
## Call:
## C5.0.formula(formula = form, data = ds[tr, vars])
## 
## Classification Tree
## Number of samples: 134001 
## Number of predictors: 20 
## 
## Tree size: 920 
## 
## Non-standard options: attempt to group attributes
C5imp(model)
##                 Overall
## humidity_3pm     100.00
## wind_gust_speed   92.96
## sunshine          86.01
## rain_today        81.72
## pressure_3pm      50.17
## rainfall          25.38
## wind_dir_3pm      20.79
## cloud_3pm         17.66
## min_temp          15.36
## max_temp          15.18
## temp_9am          14.26
## pressure_9am      13.57
## humidity_9am      10.68
## temp_3pm           8.22
## cloud_9am          7.59
## wind_speed_3pm     5.86
## wind_speed_9am     5.50
## wind_dir_9am       5.30
## wind_gust_dir      5.16
## evaporation        5.16

% DONT EVAL YET - SEEMS TO BE TAKING LONG TIME

plot(model)

I am not aware of any converter from a C5.0 tree to an rpart tree and so fancyRpartPlot() will not be useful here.



Your donation will support ongoing development and give you access to the PDF version of the book. Desktop Survival Guides include Data Science, GNU/Linux, and MLHub. Books available on Amazon include Data Mining with Rattle and Essentials of Data Science. Popular open source software includes rattle, wajig, and mlhub. Hosted by Togaware, a pioneer of free and open source software since 1984.
Copyright © 1995-2021 Graham.Williams@togaware.com Creative Commons Attribution-ShareAlike 4.0.