The University of Maryland Department of Electrical and Computer Engineering

Search
 
» INFO FOR:   Prospective Students | Current Students | Alumni | Industry & Government | Faculty & Staff | Family | Media
 
 
 
 
 
 
 
 
 
  The A. James Clark School of Engineering

Join our group on LinkedIn
Follow us on Twitter
Follow Us on Facebook
Directory

ECE Google Apps Mail

ECE Web VPN

Help Desk

Technical Operations

University Libraries

ECE Site Feedback


Give to ECE: Great Expectations Campaign





ECE Spotlight on Research



Landmark-Based Robust Speech Recognition Using Prosody-Guided Models of Speech Variability
Prof. Carol Espy-Wilson
Dr. Carol Espy-Wilson
Dr. Carol Espy-Wilson

The research will develop a system with performance comparable to humans in automatically transcribing unrestricted conversational speech, representing many speakers and dialects, and embedded in adverse acoustic environments.

Prof. Carol Espy-Wilson's approach will apply new high-dimensional machine learning techniques, constrained by empirical and theoretical studies of speech production and perception, to learn from data the information structures that human listeners extract from speech. She will develop large-vocabulary psychologically realistic models of speech acoustics, pronunciation variability, prosody, and syntax by deriving knowledge representations that reflect those proposed for human speech production and speech perception, using machine learning techniques to adjust the parameters of all knowledge representations simultaneously in order to minimize the structural risk of the recognizer.

The team will develop nonlinear acoustic landmark detectors and pattern classifiers that integrate auditory-based signal processing and acoustic phonetic processing, are invariant to noise, change in speaker characteristics and reverberation, and can be learned in a semi-supervised fashion from labeled and unlabeled data. In addition, they will use variable frame rate analysis, which will allow for multi-resolution analysis, as well as implement lexical access based on gesture, using a variety of training data.

The work will improve communication and collaboration between people and machines and also improve understanding of how human produce and perceive speech. It brings together a team of experts in speech processing, acoustic phonetics, prosody, gestural phonology, statistical pattern matching, language modeling, and speech perception, with faculty across engineering, computer science and linguistics.

Prof. Espy-Wilson is the principal investigator of the "Landmark-Based Robust Speech Recognition Using Prosody-Guided Models of Speech Variability" research project, which gained support through a two-year, $339,000 National Science Foundation grant.

return to spotlight on research

Content on this page requires a newer version of Adobe Flash Player.

Get Adobe Flash player

Content on this page requires a newer version of Adobe Flash Player.

Get Adobe Flash player

Content on this page requires a newer version of Adobe Flash Player.

Get Adobe Flash player

 

↑ Back to Top



© Copyright 2005-2013, University of Maryland
University of Maryland A. James Clark School of Engineering Department of Electrical and Computer Engineering