Go to TogaWare.com Home Page. Data Science Desktop Survival Guide
by Graham Williams
Duck Duck Go



CLICK HERE TO VISIT THE UPDATED SURVIVAL GUIDE

Line Chart Skewed Distributions

20200823

\includegraphics[width=\textwidth]{figures/onepager/ggplot2:frequency_rainfall-1}

ds %>%
  filter(rainfall != 0) %>%
  ggplot(aes(x=rainfall)) +
  geom_density() +
  scale_y_continuous(labels=comma) +
  theme(legend.position="none")

The plot informs us about the skewed nature of the amount of rainfall recorded for any one day, but we lose a lot of resolution at the low end.

Note that we use a subset of the dataset to include only those observations of rainfall (i.e., where the rainfall is non-zero). Otherwise a warning will note many rows contain non-finite values in calculating the density statistic.


Support further development by purchasing the PDF version of the book.
Other online resources include the GNU/Linux Desktop Survival Guide.
Books available on Amazon include Data Mining with Rattle and Essentials of Data Science.
Popular open source software includes rattle and wajig.
Hosted by Togaware, a pioneer of free and open source software since 1984.
Copyright © 2000-2020 Togaware Pty Ltd. . Creative Commons ShareAlike V4.