Go to TogaWare.com Home Page. Data Science Desktop Survival Guide
by Graham Williams
Duck Duck Go



CLICK HERE TO VISIT THE UPDATED SURVIVAL GUIDE

Graph Terminology

A graph is $G = (V, E)$ consisting of vertices/nodes and edges/links.

Graph links can be represented in a matrix with individual nodes labelling the rows and columns.

An attributed graph has features associated with each node which can be captured in a data frame.

Graphs may be used for social networks where the nodes represent users, the edges identify friendship or following, the node attributes identify the user features and the node class labels are user groups or tags. Publicly available datasets include flickr and blogcatalog.

Graphs may be used for citation networks where the nodes represent research papers, edges capture citation links, the node attributes are the keywords vectors generated from the research paper or its abstract, and the node class labels capture the area of research. Publicly available datasets include cora, citeseer, and pubmed.


Support further development by purchasing the PDF version of the book.
Other online resources include the GNU/Linux Desktop Survival Guide.
Books available on Amazon include Data Mining with Rattle and Essentials of Data Science.
Popular open source software includes rattle and wajig.
Hosted by Togaware, a pioneer of free and open source software since 1984.
Copyright © 2000-2020 Togaware Pty Ltd. . Creative Commons ShareAlike V4.