10.50 Wind Directions
20180723 The three wind direction variables
(wind_gust_dir
, wind_dir_9am
, wind_dir_3pm
) are also identified as character. We review
the distribution of values here with dplyr::select()
identifying any variable that tidyselect::contains() the string
_dir
and then build a base::table() over those
variables.
# Review the distribution of observations across levels.
ds %>%
select(contains("_dir")) %>%
sapply(table)
## wind_gust_dir wind_dir_9am wind_dir_3pm
## N 13806 17553 13369
## NNE 10358 12645 10551
## NE 11402 11887 12981
## ENE 12884 12351 12356
## E 14449 14595 12999
## ESE 11947 12532 13414
## SE 14670 14647 16406
## SSE 13997 14323 14179
## S 14195 13436 15062
## SSW 13992 11958 12973
## SW 13731 13000 14086
## WSW 13948 10703 14718
## W 15263 12799 15451
## WNW 12707 11712 13744
## NW 12376 13110 13005
## NNW 10234 12058 11860
Observe all 16 compass directions are represented and it would make
sense to convert this into a factor. Notice that the directions are in
alphabetic order and conversion to factor will retain that. Instead
we can construct an ordered factor to capture the compass order (from
N
, NNE
, to NW
and NNW
). We note
the ordering of the directions here.
Your donation will support ongoing availability and give you access to the PDF version of this book. Desktop Survival Guides include Data Science, GNU/Linux, and MLHub. Books available on Amazon include Data Mining with Rattle and Essentials of Data Science. Popular open source software includes rattle, wajig, and mlhub. Hosted by Togaware, a pioneer of free and open source software since 1984. Copyright © 1995-2022 Graham.Williams@togaware.com Creative Commons Attribution-ShareAlike 4.0