On Phonemes as Cognitive Components of Speech |
Ling Feng, Lars Kai Hansen
|
Abstract | COgnitive Component Analysis (COCA) defined as the process of unsupervised grouping of data such that the ensuing group structure is well-aligned with that resulting from human cognitive activity, has been explored on phoneme data. Statistical regularities have been revealed at multiple time scales. The basic features are 25-dimensional short time (20ms) melfrequency
weighted cepstral coefficients. Features are integrated by means of stacking to obtain features at longer time scales. Energy based sparsification is carried out to achieve sparse representations. Our hypothesis is ecological: we assume that features that essentially independent in a context defined ensemble can be efficiently coded using a sparse independent component representation. This means that supervised and unsupervised learning should result in similar representations. We indeed find that supervised and supervised
learning seem to identify similar representations, here, measured by the classification similarity. |
Keywords | Cognitive Component Analysis, Unsupervised Learning, Supervised Learning, Phoneme Classification |
Type | Conference paper [With referee] |
Conference | The 1st IAPR Workshop on Cognitive Information Processing |
Year | 2008 |
Electronic version(s) | [pdf] |
BibTeX data | [bibtex] |
IMM Group(s) | Intelligent Signal Processing |