Go to TogaWare.com Home Page. Data Science Desktop Survival Guide
by Graham Williams
Duck Duck Go

Extract Column as Vector

20200813 To extract a single column from a dataset and return it as a vector rather than as a dataset column we can use dplyr::pull().

In the example below all of the values of a particular column are pulled and we retain just the base::unique() values from the resulting vector.

Using a tee pipe, the first pipe (between the curly brackets) will print a number of the locations, whilst the continuing pipe flows the vector on to be assigned into a variable for later use.

ds %>%
  pull(location) %>%
  unique() %T>%
  {head(., 20) %>% print()} ->
locations
##  [1] "Albury"        "BadgerysCreek" "Cobar"         "CoffsHarbour" 
##  [5] "Moree"         "Newcastle"     "NorahHead"     "NorfolkIsland"
##  [9] "Penrith"       "Richmond"      "Sydney"        "SydneyAirport"
## [13] "WaggaWagga"    "Williamtown"   "Wollongong"    "Canberra"     
## [17] "Tuggeranong"   "MountGinini"   "Ballarat"      "Bendigo"


Support further development by purchasing the PDF version of the book.
Other online resources include the GNU/Linux Desktop Survival Guide.
Books available on Amazon include Data Mining with Rattle and Essentials of Data Science.
Popular open source software includes rattle and wajig.
Hosted by Togaware, a pioneer of free and open source software since 1984.
Copyright © 2000-2020 Togaware Pty Ltd. . Creative Commons ShareAlike V4.