Go to TogaWare.com Home Page. Data Science Desktop Survival Guide
by Graham Williams
Duck Duck Go



CLICK HERE TO VISIT THE UPDATED SURVIVAL GUIDE

Skewed Distributions

Raw The plot of Figure 2.3 certainly informs us about the skewed nature of the amount of rainfall recorded for any one day, but we lose a lot of resolution at the low end.

Note that we use a subset of the dataset to include only those observations of rainfall (i.e., where the rainfall is non-zero). Otherwise a warning will note many rows contain non-finite values in calculating the density statistic.

ds %>%
  filter(rainfall != 0) %>%
  ggplot(aes(x=rainfall)) +
  geom_density() +
  scale_y_continuous(labels=comma) +
  theme(legend.position="none")

Figure 2.3: Skewed density distribution.

\includegraphics[width=\textwidth]{figures/onepager/ggplot2:frequency_rainfall-1}


Support further development by purchasing the PDF version of the book.
Other online resources include the GNU/Linux Desktop Survival Guide.
Books available on Amazon include Data Mining with Rattle and Essentials of Data Science.
Popular open source software includes rattle and wajig.
Hosted by Togaware, a pioneer of free and open source software since 1984.
Copyright © 2000-2020 Togaware Pty Ltd. . Creative Commons ShareAlike V4.