Clustering of scientific citations in Wikipedia |
|
| Abstract | The instances of templates in Wikipedia form an interesting data set of structured information. Here I focus on the cite journal template that is primarily used for citation to articles in scientific journals. These citations can be extracted and analyzed: Non-negative matrix factorization is performed on a (article x journal) matrix resulting in a soft clustering of Wikipedia articles and scientific journals, each cluster more or less representing a scientific topic. |
| Keywords | Clustering, Wikipedia, non-negative matrix factorization, scientometrics |
| Type | Conference paper [With referee] |
| Conference | Wikimania 2008 |
| Year | 2008 Month June |
| Publisher | Informatics and Mathematical Modelling, Technical University of Denmark |
| Address | Richard Petersens Plads, Building 321, DK-2800 Kgs. Lyngby |
| Electronic version(s) | [pdf] |
| BibTeX data | [bibtex] |
| IMM Group(s) | Intelligent Signal Processing |