Paper Abstract and Keywords |
Presentation |
2014-12-12 15:40
[Poster Presentation]
Study on signal to noise ratio estimation based on optimal design of subband voice activity detection Shota Morita (JAIST), Xugang Lu (NICT), Masashi Unoki (JAIST) EA2014-46 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
Estimation of signal to noise ratio (SNR) of speech plays an important role of noise reduction and speech intelligibility prediction based on the speech transmission index (STI). During estimation of SNR, voice activity detection (VAD) techniques must be used explicitly or implicitly to detect speech and non-speech sections. In most studies, the decision of threshold is fixed for VAD to speech and non-speech classification during SNR estimation. We argue that fixing the decision of threshold for all testing conditions is not optimal in controlling the false acceptance and miss detection rates of speech. In this study, we proposed SNR estimation using a speech and non-speech detection algorithm based on optimizing the trade-off between speech false acceptance and miss detection rates on a receiver operating characteristic (ROC) curve. Rather than fixing decision threshold in VAD for all SNR conditions, we optimally estimate the decision threshold using an ROC curve for each SNR condition. Thresholds are optimized in subband signals on a large training data set composed of various of SNR conditions and noise types. After making the speech and non-speech detection, the SNR is estimated by summarizing the subband powers of speech and noise from all subbands. Experimental results show that the proposed method has higher accuracy than the classical SNR estimation. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
SNR estimation / voice activity detection / optimization of threshold / subband processing / Threshold-SNR curve / / / |
Reference Info. |
IEICE Tech. Rep., vol. 114, no. 358, EA2014-46, pp. 37-42, Dec. 2014. |
Paper # |
EA2014-46 |
Date of Issue |
2014-12-05 (EA) |
ISSN |
Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
EA2014-46 |
Conference Information |
Committee |
EA |
Conference Date |
2014-12-12 - 2014-12-13 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
Satellite Plaza of Kanazawa University |
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
General topics |
Paper Information |
Registration To |
EA |
Conference Code |
2014-12-EA |
Language |
Japanese |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
Study on signal to noise ratio estimation based on optimal design of subband voice activity detection |
Sub Title (in English) |
|
Keyword(1) |
SNR estimation |
Keyword(2) |
voice activity detection |
Keyword(3) |
optimization of threshold |
Keyword(4) |
subband processing |
Keyword(5) |
Threshold-SNR curve |
Keyword(6) |
|
Keyword(7) |
|
Keyword(8) |
|
1st Author's Name |
Shota Morita |
1st Author's Affiliation |
Japan Advanced Institute of Science and Technology (JAIST) |
2nd Author's Name |
Xugang Lu |
2nd Author's Affiliation |
National Institute of Information and Communications Technology (NICT) |
3rd Author's Name |
Masashi Unoki |
3rd Author's Affiliation |
Japan Advanced Institute of Science and Technology (JAIST) |
4th Author's Name |
|
4th Author's Affiliation |
() |
5th Author's Name |
|
5th Author's Affiliation |
() |
6th Author's Name |
|
6th Author's Affiliation |
() |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2014-12-12 15:40:00 |
Presentation Time |
80 minutes |
Registration for |
EA |
Paper # |
EA2014-46 |
Volume (vol) |
vol.114 |
Number (no) |
no.358 |
Page |
pp.37-42 |
#Pages |
6 |
Date of Issue |
2014-12-05 (EA) |
|