Question: (Implementation project) The DBLP data set (https://dblp.uni-trier.de/xml/) consists of over three million entries of research papers published in computer science conferences and journals. Among these
(Implementation project) The DBLP data set (https://dblp.uni-trier.de/xml/) consists of over three million entries of research papers published in computer science conferences and journals. Among these entries, there are a good number of authors that have coauthor relationships.
a. Propose a method to efficiently mine a set of coauthor relationships that are closely correlated (e.g., often coauthoring papers together).
b. Based on the mining results and the pattern evaluation measures discussed in this chapter, discuss which measure may convincingly uncover close collaboration patterns better than others.
c. Based on the study in (a), develop a method that can roughly predict advisor and advisee relationships and the approximate period for such advisory supervision.
Step by Step Solution
3.52 Rating (149 Votes )
There are 3 Steps involved in it
a The proposed method to efficiently mine a set of coauthor relationships that are closely correlated 1 Data Collection Download the DBLP dataset The dataset is available in XML format and contains de... View full answer
Get step-by-step solutions from verified subject matter experts
