Internet Learning Volume 3, Number 2, Fall 2014 | Page 81

Internet Learning Figure 3. A Graph Database Schema for Threaded Discussion Data. V - Graph Database Schema and Technologies Because network thinking is fundamental to our approach, we will preface our data analysis with a conceptual overview of our graph database schema, and a technical summary of the graph technologies we used. We will reference this schema in our discussion of each research question. A. Conceptual Overview: Graph Database Schema We engaged with the applied graph science experts at the Aurelius consulting group, creators of the open-source TinkerPop graph computing stack, to model the conversational data as a network schema (a ‘directed property graph’), build a graph database against 80 that schema, and design a domain specific language (DSL) for traversing and interrogating the threaded discussion graph. We found several benefits to modeling the data as a graph, as shown in Figure 3. First, as a data structure, the graph allows us to pose many questions in an exploratory and intuitive manner. Second, the familiar concept map construct eased discussion and reasoning about the data among more- and less-technical researchers. This was particularly important given that we expected to discover new and important questions over the course of the study. Finally, the graph-structured data is easily exported in forms that can be used with existing network visualization tools. This allowed us to use visualization as a first-class investigative tool over the course of the study, as well as a post-hoc story-telling tool.