Association Analytics for Network Connectivity in a Bibliographic and Expertise Dataset

Authors: Boanerges Aleman-Meza, Sheron L. Decker, Delroy Cameron, I. Budak Arpinar

Abstract: Large-scale bibliography datasets are becoming increasingly available for use by Semantic Web applications. For example, DBLP is a high-quality bibliography of Computer Science literature. Its data is available in XML but it has also been made available in RDF as DR2Q-generated RDF data (Bizer, 2003), also in the SwetoDblp ontology of DBLP data (lsdis.cs.uga.edu/projects/semdis/swetodblp/), and Andreas Harth’s DBLP dataset in RDF (sw.deri.org/~aharth/2004/07/dblp/). Various studies have used DBLP data to analyze co-authorship, collaborations, degrees of separation and other social network analysis measures. We claim that further and more detailed analysis is possible by using semantically marked-up datasets. In this paper, we describe a study of network connectivity in bibliography data. Our work expands upon earlier studies that have used subset of DBLP data for analysis of collaborations in the field databases (Elmacioglu, 2005; Nascimento, 2003). The dataset we use includes not only data of publication in database field but also of research areas such as Artificial Intelligence, Web and Semantic Web.

Proceedings: Book Chapter (Semantic Web Engineering in the Knowledge Society)

Download

  • Paper
  •