11.26 Box Plot in Rattle
20240728
ds %>%
dplyr::mutate(rain_tomorrow=as.factor(rain_tomorrow)) %>%
ggplot2::ggplot(ggplot2::aes(y=min_temp)) +
ggplot2::geom_boxplot(ggplot2::aes(x="All"), notch=TRUE, fill="grey") +
ggplot2::stat_summary(ggplot2::aes(x="All"), fun=mean, geom="point", shape=8) +
ggplot2::geom_boxplot(ggplot2::aes(x=rain_tomorrow, fill=rain_tomorrow), notch=TRUE) +
ggplot2::stat_summary(ggplot2::aes(x=rain_tomorrow), fun=mean, geom="point", shape=8) +
ggplot2::xlab(paste("rain_tomorrow\n\n", "RattleNG 2024-07-28 06:25:32 gjw", sep="")) +
ggplot2::ggtitle("Distribution of min_temp by rain_tomorrow") +
ggplot2::theme(legend.position="none") +
theme_rattle()
In Rattle, if we have identified a Target (or a Group By) variable,
then the box plot will show the distribution of the values of the
variable partitioned by the values of the target/group variable. For
the weather dataset we can, for example, plot the variable min_temp
grouped by rain_tomorrow
.
Your donation will support ongoing availability and give you access to the PDF version of this book. Desktop Survival Guides include Data Science, GNU/Linux, and MLHub. Books available on Amazon include Data Mining with Rattle and Essentials of Data Science. Popular open source software includes rattle, wajig, and mlhub. Hosted by Togaware, a pioneer of free and open source software since 1984. Copyright © 1995-2022 Graham.Williams@togaware.com Creative Commons Attribution-ShareAlike 4.0