Sign in
Author
|
Conference
|
Journal
|
Organization
|
Year
|
DOI
Look for results that meet for the following criteria:
since
equal to
before
between
and
Search in all fields of study
Limit my searches in the following fields of study
Agriculture Science
Arts & Humanities
Biology
Chemistry
Computer Science
Economics & Business
Engineering
Environmental Sciences
Geosciences
Material Science
Mathematics
Medicine
Physics
Social Science
Multidisciplinary
Keywords
(10)
Anisotropic Diffusion
Emotion Recognition
Empirical Mode Decomposition
Empirical Model
Feature Extraction
Intrinsic Mode Function
Oregon
renyi entropy
Spectral Analysis
mel frequency cepstral coefficient
Subscribe
Academic
Publications
Study of empirical mode decomposition and spectral analysis for stress and emotion classification in natural speech
Study of empirical mode decomposition and spectral analysis for stress and emotion classification in natural speech,10.1016/j.bspc.2010.11.001,Biomedi
Edit
Study of empirical mode decomposition and spectral analysis for stress and emotion classification in natural speech
BibTex
|
RIS
|
RefWorks
Download
Ling He
,
Margaret Lech
,
Namunu C. Maddage
,
Nicholas B. Allen
Two new approaches to the
feature extraction
process for automatic stress and emotion classification in speech are proposed and examined. The first method uses the
empirical model
decomposition (EMD) of speech into intrinsic mode functions (IMF) and calculates the average
Renyi entropy
for the IMF channels. The second method calculates the average spectral energy in the sub-bands of speech spectrograms and can be enhanced by
anisotropic diffusion
filtering of spectrograms. In the second method, three types of sub-bands were examined: critical, Bark and ERB. The performance of the new features was compared with the conventional mel frequency cepstral coefficients (MFCC) method. The modeling and classification process applied the classical GMM and KNN algorithms. The experiments used two databases containing natural speech, SUSAS (annotated with three different levels of stress) and the
Oregon
Research Institute (ORI) data (annotated with five different emotions: neutral, angry, anxious, dysphoric, and happy). For the SUSAS data, the best average recognition rates of 77% were obtained when using spectrogram features calculated within ERB bands and combined with anisotropic filtering. For the ORI data, the best result of 53% was obtained with the same method but without anisotropic filtering. Both the GMM and KNN classifiers showed similar performance. These results indicated that the spectrogram patterns provide promising results in the case of stress recognition, however further improvements are needed in the case of emotion recognition.
Journal:
Biomedical Signal Processing and Control - BIOMED SIGNAL PROCESS CONTROL
, vol. 6, no. 2, pp. 139-146, 2011
DOI:
10.1016/j.bspc.2010.11.001
Cumulative
Annual
View Publication
The following links allow you to view full publications. These links are maintained by other sources not affiliated with Microsoft Academic Search.
(
www.sciencedirect.com
)