Clustering of scientific citations in Wikipedia (slides)



AbstractThe instances of templates in Wikipedia form an interesting data set of structured information. Here I focus on the cite journal template that is primarily used for citation to articles in scientific journals. These citations can be extracted and analyzed: Non-negative matrix factorization is performed on a (article x journal) matrix resulting in a soft clustering of Wikipedia articles and scientific journals, each cluster more or less representing a scientific topic.
KeywordsWikipedia, non-negative matrix factorization, clustering, scientometrics, Wikimania, citation
TypeMisc [Presentation]
Journal/Book/ConferenceWikimania 2008
Year2008    Month July
PublisherInformatics and Mathematical Modelling, Technical University of Denmark
AddressRichard Petersens Plads, Building 321, DK-2800 Kgs. Lyngby
Electronic version(s)[pdf]
BibTeX data [bibtex]
IMM Group(s)Intelligent Signal Processing