Speaker Recognition | Ling Feng
| Abstract | The work leading to this thesis has been focused on establishing a text-independent closed-set speaker recognition system. Contrary to other recognition systems, this system was built with two parts for the purpose of improving the recognition accuracy. The first part is the speaker pruning performed by KNN algorithm. To decrease the gender misclassification in KNN, a novel technique was used, where Pitch and MFCC features were combined. This technique, in fact, does not only improve the gender misclassification, but also leads to an increase on the total performance of the pruning. The second part is the DDHMM speaker recognition performed on the survived speakers after pruning. By adding the speaker pruning part, the system recognition accuracy was increased 9.3%.
During the project period, an English Language Speech Database for Speaker Recognition (ELSDSR) was built. The system was trained and tested with both TIMIT and ELSDSR database. | Keywords | feature extraction, MFCC, KNN, speaker pruning, DDHMM, speaker recognition and ELSDSR | Type | Master's thesis [Academic thesis] | Year | 2004 | Publisher | Informatics and Mathematical Modelling, Technical University of Denmark, DTU | Address | Richard Petersens Plads, Building 321, DK-2800 Kgs. Lyngby | Series | IMM-Thesis-2004-73 | Note | Supervised by Prof. Lars Kai Hansen | Electronic version(s) | [pdf] | BibTeX data | [bibtex] | IMM Group(s) | Intelligent Signal Processing |
|