Data Science Desktop Survival Guide
by Graham Williams |
|||||
Line Chart Skewed Distributions |
20200823
ds %>%
filter(rainfall != 0) %>% ggplot(aes(x=rainfall)) + geom_density() + scale_y_continuous(labels=comma) + theme(legend.position="none")
The plot informs us about the skewed nature of the amount of rainfall recorded for any one day, but we lose a lot of resolution at the low end. Note that we use a subset of the dataset to include only those observations of rainfall (i.e., where the rainfall is non-zero). Otherwise a warning will note many rows contain non-finite values in calculating the density statistic.
|