Sign in
Author
|
Conference
|
Journal
|
Organization
|
Year
|
DOI
Look for results that meet for the following criteria:
since
equal to
before
between
and
Search in all fields of study
Limit my searches in the following fields of study
Agriculture Science
Arts & Humanities
Biology
Chemistry
Computer Science
Economics & Business
Engineering
Environmental Sciences
Geosciences
Material Science
Mathematics
Medicine
Physics
Social Science
Multidisciplinary
Keywords
(10)
Decision Rule
Discrete Fourier Transform
False Alarm Probability
Probability of Detection
Signal To Noise Ratio
Speech Processing
Statistical Model
Voice Activity Detection
Hidden Markov Model
Likelihood Ratio Test
Subscribe
Academic
Publications
A modified MAP criterion based on hidden Markov model for voice activity detecion
A modified MAP criterion based on hidden Markov model for voice activity detecion,10.1109/ICASSP.2011.5947534,Shiwen Deng,Jiqing Han,Tieran Zheng,Guib
Edit
A modified MAP criterion based on hidden Markov model for voice activity detecion
BibTex
|
RIS
|
RefWorks
Download
Shiwen Deng
,
Jiqing Han
,
Tieran Zheng
,
Guibin Zheng
The maximum a posteriori (MAP) criterion is broadly used in the statistical model-based
voice activity detection
(VAD) approaches. In the conventional MAP criterion, however, the inter-frame correlation of the voice activity is not taken into consideration. In this paper, we proposes a novel modified MAP criterion based on a two-state
hidden Markov model
(HMM) to improve the performance of the VAD, and the the inter-frame correlation of the voice activity is modeled. With the proposed MAP criterion, the
decision rule
is derived by explicitly incorporating the ap riori, a posteriori, and inter-frame correlation information into the
likelihood ratio test
(LRT). In the LRT, a compensation factor for the hypothesis of speech presence is used to regulate the trade-off between the
probability of detection
and the false alarm probability. Experimental results show the superiority of the VAD algorithm based on the proposed MAP criterion in comparison with that based on the recent conditional MAP criterion (CMAP) under various noise conditions.
Conference:
International Conference on Acoustics, Speech, and Signal Processing - ICASSP
, pp. 5220-5223, 2011
DOI:
10.1109/ICASSP.2011.5947534
Cumulative
Annual
View Publication
The following links allow you to view full publications. These links are maintained by other sources not affiliated with Microsoft Academic Search.
(
ieeexplore.ieee.org
)
(
ieeexplore.ieee.org
)
References
(12)
Single channel speech enhancement based on masking properties of the human auditory system
(
Citations: 265
)
N. Virag
Published in 1999.
ITU-T Recommendation G.729 Annex B: a silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applications
(
Citations: 74
)
A. Benyassine
,
E. Shlomot
,
H.-Y. Su
,
D. Massaloux
,
C. Lamblin
,
J.-P. Petit
Journal:
IEEE Communications Magazine - IEEE Commun. Mag.
, vol. 35, no. 9, pp. 64-73, 1997
Robust voice activity detection algorithm for estimating noise spectrum
(
Citations: 58
)
Kyoung-Ho Woo
,
Tae-Young Yang
,
Kun-Jung Park
,
Chungyong Lee
Journal:
Electronics Letters - ELECTRON LETT
, vol. 36, no. 2, 2000
Speech pause detection for noise spectrum estimation by tracking power envelope dynamics
(
Citations: 108
)
Mark Marzinzik
,
Birger Kollmeier
Journal:
IEEE Transactions on Speech and Audio Processing - IEEE SAP
, vol. 10, no. 2, pp. 109-118, 2002
An Algorithm for Determining the Endpoints for Isolated Utterances
(
Citations: 207
)
L. R. Rabiner
,
M. R. Sambur
Published in 1975.