Paper Abstract and Keywords |
Presentation |
2014-05-25 11:30
Modulation transfer function based robust method of voice activity detection for noisy reverberant environments
-- Utilization of subband SNR estimation -- Shota Morita, Masashi Unoki (JAIST), Xugang Lu (NICT), Masato Akagi (JAIST) SP2014-41 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
Most of the current voice activity detection (VAD) algorithms deal with clean speech or additive noisy speech. However, the performance in noisy reverberant environments are poor if they do, because speech is smeared due to the effects of noise and reverberation. This paper proposes a robust VAD algorithm for precisely detecting speech and non-speech periods in noisy reverberant environments. The proposed VAD algorithm consists of three processes. The first process is an estimation of the signal to noise ratio (SNR) which is used to mitigate the additive noise effect on the power envelope of speech. The second process is a power envelope dereverberation based on the concept of modulation transfer function. The final process is a threshold processing on the dereverberated power envelope of speech for speech/non-speech decision. Proposed VAD and conventional VAD methods are evaluated the detection performance of speech/non-speech in both artificial and realistic noisy reverberant environments. As the results, the proposed VAD algorithm significantly outperforms the conventional VAD algorithms. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
voice activity detection / modulation transfer function / noisy reverberant environments / signal to noise ratio estimation / power envelope restoration / / / |
Reference Info. |
IEICE Tech. Rep., vol. 114, no. 52, SP2014-41, pp. 383-388, May 2014. |
Paper # |
SP2014-41 |
Date of Issue |
2014-05-17 (SP) |
ISSN |
Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
SP2014-41 |
Conference Information |
Committee |
SP IPSJ-MUS |
Conference Date |
2014-05-24 - 2014-05-25 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
|
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
|
Paper Information |
Registration To |
SP |
Conference Code |
2014-05-SP-MUS |
Language |
Japanese |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
Modulation transfer function based robust method of voice activity detection for noisy reverberant environments |
Sub Title (in English) |
Utilization of subband SNR estimation |
Keyword(1) |
voice activity detection |
Keyword(2) |
modulation transfer function |
Keyword(3) |
noisy reverberant environments |
Keyword(4) |
signal to noise ratio estimation |
Keyword(5) |
power envelope restoration |
Keyword(6) |
|
Keyword(7) |
|
Keyword(8) |
|
1st Author's Name |
Shota Morita |
1st Author's Affiliation |
Japan Advanced Institute of Science and Technology (JAIST) |
2nd Author's Name |
Masashi Unoki |
2nd Author's Affiliation |
Japan Advanced Institute of Science and Technology (JAIST) |
3rd Author's Name |
Xugang Lu |
3rd Author's Affiliation |
National Institute of Information and Communications Technology (NICT) |
4th Author's Name |
Masato Akagi |
4th Author's Affiliation |
Japan Advanced Institute of Science and Technology (JAIST) |
5th Author's Name |
|
5th Author's Affiliation |
() |
6th Author's Name |
|
6th Author's Affiliation |
() |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2014-05-25 11:30:00 |
Presentation Time |
240 minutes |
Registration for |
SP |
Paper # |
SP2014-41 |
Volume (vol) |
vol.114 |
Number (no) |
no.52 |
Page |
pp.383-388 |
#Pages |
6 |
Date of Issue |
2014-05-17 (SP) |