Sign in
Author
|
Conference
|
Journal
|
Organization
|
Year
|
DOI
Look for results that meet for the following criteria:
since
equal to
before
between
and
Search in all fields of study
Limit my searches in the following fields of study
Agriculture Science
Arts & Humanities
Biology
Chemistry
Computer Science
Economics & Business
Engineering
Environmental Sciences
Geosciences
Material Science
Mathematics
Medicine
Physics
Social Science
Multidisciplinary
Keywords
(10)
Automatic Speech Recognition
Feature Extraction
Harmonic Analysis
Noise Robustness
Signal Analysis
Speech Recognition
Time Frequency Analysis
Filter Bank
mel frequency cepstral coefficient
Short Time Fourier Transform
Subscribe
Academic
Publications
Non-stationary feature extraction for automatic speech recognition
Non-stationary feature extraction for automatic speech recognition,10.1109/ICASSP.2011.5947530,Zoltan Tuske,Pavel Golik,Ralf Schluter,Friedhelm R. Dre
Edit
Non-stationary feature extraction for automatic speech recognition
BibTex
|
RIS
|
RefWorks
Download
Zoltan Tuske
,
Pavel Golik
,
Ralf Schluter
,
Friedhelm R. Drepper
In current
speech recognition
systems mainly Short-Time
Fourier Transform
based features like MFCC are applied. Dropping the short-time stationarity assumption of the voiced speech, this paper introduces the non-stationary
signal analysis
into the ASR framework. We present new acoustic features extracted by a pitch-adaptive Gammatone filter bank. The
noise robustness
was proved on AURORA 2 and 4 tasks, where the proposed features outperform the standard MFCC. Furthermore, successful combination experiments via ROVER indicate the differences between the new features and MFCC.
Conference:
International Conference on Acoustics, Speech, and Signal Processing - ICASSP
, pp. 5204-5207, 2011
DOI:
10.1109/ICASSP.2011.5947530
Cumulative
Annual
View Publication
The following links allow you to view full publications. These links are maintained by other sources not affiliated with Microsoft Academic Search.
(
ieeexplore.ieee.org
)
(
ieeexplore.ieee.org
)
References
(11)
Segregation of concurrent sounds. I: Effects of frequency modulation coherence
(
Citations: 38
)
Stephen McAdams
Journal:
Journal of The Acoustical Society of America - J ACOUST SOC AMER
, vol. 86, no. 6, pp. 2148-2159, 1989
A Two-Level Drive - Response Model of Non-stationary Speech Signals
(
Citations: 3
)
Friedhelm R. Drepper
Conference:
Nonlinear Analyses and Algorithms for Speech Processing - NOLISP
, pp. 125-138, 2005
A harmonic-model-based front end for robust speech recognition
(
Citations: 8
)
Michael L. Seltzer
,
Jasha Droppo
,
Alex Acero
Published in 2003.
Pitch adaptive features for LVCSR
(
Citations: 1
)
Giulia Garau
,
Steve Renals
Conference:
Annual Conference of the International Speech Communication Association - INTERSPEECH
, pp. 2402-2405, 2008
Introducing the Differentiated All-Pole and One-Zero Gammatone Filter Responses and their Analog VLSI Log-domain Implementation
(
Citations: 3
)
A. G. Katsiamis
,
E. M. Drakakis
,
Richard F. Lyon
Conference:
Midwest Symposium on Circuits and Systems - MWSCAS
, vol. 1, pp. 561-565, 2006