Clustering of scientific citations in Wikipedia 
Finn Årup Nielsen

Abstract  The instances of templates in Wikipedia form an interesting data set of structured information. Here I focus on the cite journal template that is primarily used for citation to articles in scientific journals. These citations can be extracted and analyzed: Nonnegative matrix factorization is performed on a (article x journal) matrix resulting in a soft clustering of Wikipedia articles and scientific journals, each cluster more or less representing a scientific topic. 
Keywords  Clustering, Wikipedia, nonnegative matrix factorization, scientometrics 
Type  Conference paper [With referee] 
Conference  Wikimania 2008 
Year  2008 Month June 
Publisher  Informatics and Mathematical Modelling, Technical University of Denmark 
Address  Richard Petersens Plads, Building 321, DK2800 Kgs. Lyngby 
Electronic version(s)  [pdf] 
BibTeX data  [bibtex] 
IMM Group(s)  Intelligent Signal Processing 