Clark School Home UMD
ECE

ECE News Story

Espy-Wilson Receives NSF Grant for Robust Speech Recognition

Espy-Wilson Receives NSF Grant for Robust Speech Recognition

Prof. Carol Espy-Wilson
Prof. Carol Espy-Wilson

Professor Carol Espy-Wilson (ECE/ISR) is the principal investigator of a two-year, $339,000 National Science Foundation grant for "Landmark-based Robust Speech Recognition Using Prosody-Guided Models of Speech Variability."

The research will develop a system with performance comparable to humans in automatically transcribing unrestricted conversational speech, representing many speakers and dialects, and embedded in adverse acoustic environments.

Espy-Wilson's approach will apply new high-dimensional machine learning techniques, constrained by empirical and theoretical studies of speech production and perception, to learn from data the information structures that human listeners extract from speech. She will develop large-vocabulary psychologically realistic models of speech acoustics, pronunciation variability, prosody, and syntax by deriving knowledge representations that reflect those proposed for human speech production and speech perception, using machine learning techniques to adjust the parameters of all knowledge representations simultaneously in order to minimize the structural risk of the recognizer.

The team will develop nonlinear acoustic landmark detectors and pattern classifiers that integrate auditory-based signal processing and acoustic phonetic processing, are invariant to noise, change in speaker characteristics and reverberation, and can be learned in a semi-supervised fashion from labeled and unlabeled data. In addition, they will use variable frame rate analysis, which will allow for multi-resolution analysis, as well as implement lexical access based on gesture, using a variety of training data.

The work will improve communication and collaboration between people and machines and also improve understanding of how human produce and perceive speech. It brings together a team of experts in speech processing, acoustic phonetics, prosody, gestural phonology, statistical pattern matching, language modeling, and speech perception, with faculty across engineering, computer science and linguistics.

May 17, 2007


Prev   Next

Current Headlines

Ott, Yorke, and Grebogi recognized by Thompson Reuters as 2016 Citation Laureates in Physics

Professor Khaligh Named Area Editor for the IEEE Transactions on Vehicular Technology

Milchberg and Khaligh Receive 2016 Junior and Senior Faculty Outstanding Research Awards

Prof. Ott, Prof. Yorke & Alumnus Grebogi Named 2016 Thomson Reuters Citation Laureates

ECE Students Advised by Rama Chellappa win Best Poster Award and nVIDIA Best Paper Award at IEEE BTAS 2016

Yeung Receives National Science Foundation Award

Narayan, Zhou, Schlotfeldt, Strahan win ISR outstanding awards

Chellappa to Receive Distinguished Alumnus Award from Indian Institute of Science

News Resources

Return to Newsroom

Search News

Archived News

Events Resources

Events Calendar

Additional Resources

UM Newsdesk

Faculty Experts