TR-2007-07
A Probabilistic Speech Recognition Framework Based on the Temporal
Dynamics of Distinctive Feature Landmark Detectors
Aren Jansen; Partha Niyogi. 1 June, 2007.
Communicated by Partha Niyogi.
Abstract
In this paper, we elaborate on a computational model for speech
recognition that is inspired by several different interrelated strands
of research in phonology, acoustic phonetics, speech perception, and
neuroscience. Our goals are twofold: (i) to explore frameworks for
recognition that may provide a viable alternative to the current hidden
Markov model (HMM) based speech recognition systems (ii) to provide a
computational platform that will allow us to engage, quantify, and test
various theories in the scientific traditions in phonetics, psychology,
and neuroscience. Our approach uses the notion of distinctive features,
constructs a hierarchically structured point process representation
based on feature detectors, and probabilistically integrates the firing
patterns of these detectors to decode a phonetic sequence. We find the
accuracy of a broad class recognizer based on this framework to be
competitive with equivalent HMM-based systems. We conclude by outlining
various avenues for future development of our methodology.
Original Document
The original document is available in PDF (uploaded 1 June, 2007 by
Partha Niyogi).