10.48 Location
20180723 From our review of the data so far we start to
make some observations about the character variables. The first is
location. We note that several locations were reported in
the above exploration of the dataset. We can confirm the number of
locations by counting the number of base::unique() values
the variable has in the original dataset.
## [1] 49
We may not know in general what other locations we will come across in related datasets and we already have quite a collection of 49 locations. We will retain this variable as a character data type.
Here is a list of locations and their frequencies in the dataset.
## .
## Adelaide Albany Albury AliceSprings
## 3924 4739 4740 4740
## BadgerysCreek Ballarat Bendigo Brisbane
## 4690 4738 4731 4893
## Cairns Canberra Cobar CoffsHarbour
## 4740 5136 4709 4709
## Dartmoor Darwin GoldCoast Hobart
## 4709 4893 4740 4893
## Katherine Launceston Melbourne MelbourneAirport
## 3278 4740 4893 4709
## Mildura Moree MountGambier MountGinini
## 4709 4709 4739 4728
## Newcastle Nhil NorahHead NorfolkIsland
## 4030 3278 4704 4709
## Nuriootpa PearceRAAF Penrith Perth
## 4708 4708 4739 4892
## PerthAirport Portland Richmond Sale
## 4708 4709 4709 4709
## SalmonGums Sydney SydneyAirport Townsville
## 4662 5044 4709 4740
## Tuggeranong Uluru WaggaWagga Walpole
## 4739 3278 4709 4705
## Watsonia Williamtown Witchcliffe Wollongong
....
Your donation will support ongoing availability and give you access to the PDF version of this book. Desktop Survival Guides include Data Science, GNU/Linux, and MLHub. Books available on Amazon include Data Mining with Rattle and Essentials of Data Science. Popular open source software includes rattle, wajig, and mlhub. Hosted by Togaware, a pioneer of free and open source software since 1984. Copyright © 1995-2022 Graham.Williams@togaware.com Creative Commons Attribution-ShareAlike 4.0