We might also have some specific transformations we would like to perform. The examples here may or may not be useful, depending on how we want to analyse the documents. This is really for illustration using the part of the document we are looking at here, rather than suggesting this specific transform adds value.
<- content_transformer(function(x, from, to) gsub(from, to, x)) toString <- tm_map(docs, toString, "harbin institute technology", "HIT") docs <- tm_map(docs, toString, "shenzhen institutes advanced technology", "SIAT") docs <- tm_map(docs, toString, "chinese academy sciences", "CAS")docs
Your donation will support ongoing availability and give you access to the PDF version of this book. Desktop Survival Guides include Data Science, GNU/Linux, and MLHub. Books available on Amazon include Data Mining with Rattle and Essentials of Data Science. Popular open source software includes rattle, wajig, and mlhub. Hosted by Togaware, a pioneer of free and open source software since 1984. Copyright © 1995-2022 Graham.Williams@togaware.com Creative Commons Attribution-ShareAlike 4.0