Skip to content. | Skip to navigation

Informatik 5
Information Systems
Prof. Dr. M. Jarke
Sections
Personal tools
You are here: Home Theses Comparing Communities and Topics in Wikipedias

Contact

Prof. Dr. M. Jarke
RWTH Aachen
Informatik 5
Ahornstr. 55
D-52056 Aachen
Tel +49/241/8021501
Fax +49/241/8022321

How to find us

Annual Reports

Disclaimer

Webmaster

 

 

Comparing Communities and Topics in Wikipedias

Thesis type
  • Bachelor
Status Open
Supervisor(s)
Advisor(s)

Investigate the relationship between communities and topics by applying overlapping community detection to the social network of contributors and subsets of intrawiki link networks on different Wikipedias.

Wikipedia is the largest crowd-sourced accumulation of human knowledge. Authors relate articles through intrawiki links, create category and list pages and cross reference sections. These links form a directed graph or network connecting content that is semantically related. Similarly contributors form a social network by editing the same pages. While the link network structure allows clustering pages into topics, the social network allows us to detect communities of contributors.

In this Bachelor Thesis we want to investigate the relationship between communities and topics. Therefore, we apply overlapping community detection to the social network of contributors and subsets of intrawiki link networks of different Wikipedias to analyze if communities match topic clusters.

As a technological infrastructure we use ArangoDB to store graph data and the ArangoDB Foxx microservice framework to deploy JavaScript microservices.

Students will be required to extract network information, store it into ArangoDBs graph database, and implement overlapping community detection algorithms for ArangoDBs pregel-based graph processor.

Prerequisites

Applicants should be proficient in JavaScript. Experience with server-side JavaScript and basic knowledge about graph theory is advantageous.

Document Actions