Bachelorprojekt - Softwareteknologi | Project No. 0014: Textual Similarity |
Aktuelle | Tidligere |
This project is concerned with comparing two texts in order to discover how closely they discuss the same topic(s). This is for example useful in analysing Web pages, in comparing different specifications of a piece of software, and in many other contexts. Several algorithms have been proposed in the literature, and the project involves implementing at least two different ones, in order to compare their performance. This includes not only their complexity (i.e. the effort required to perform the comparison of given texts), but also the accuracy which they achieve in detecting similar texts.
Supervisor(s) Robin Sharp
Sidst opdateret: Oct 31, 2011 af Hans Henrik Løvengreen |