Paper Abstract and Keywords |
Presentation |
2010-01-21 11:40
Online speaker clustering using an ergodic HMM and its application to meeting minute generation Takafumi Koshinaka, Kentaro Nagatomo, Kenji Satoh (NEC Corp.) CQ2009-62 PRMU2009-161 SP2009-102 MVE2009-84 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
A novel online speaker clustering method suitable for real-time applications is proposed. Using an ergodic hidden Markov model (HMM), it employs incremental learning based on a variational Bayesian framework and provides probabilistic (non-deterministic) decisions for each input utterance, directly considering the specific history of preceding utterances. It makes possible more robust cluster estimation and precise classification of utterances than do conventional online methods. Experiments on meeting-speech data show that the proposed method produces 60-80% fewer classification errors than a conventional method does. They also show that it reduces speech recognition errors when combined with unsupervised speaker adaptation. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
Hidden Markov models / variational Bayesian inference / model selection / meeting speech recognition / / / / |
Reference Info. |
IEICE Tech. Rep., vol. 109, no. 375, SP2009-102, pp. 39-44, Jan. 2010. |
Paper # |
SP2009-102 |
Date of Issue |
2010-01-14 (CQ, PRMU, SP, MVE) |
ISSN |
Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
CQ2009-62 PRMU2009-161 SP2009-102 MVE2009-84 |
Conference Information |
Committee |
PRMU SP MVE CQ |
Conference Date |
2010-01-21 - 2010-01-22 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
Kyoto Univ. |
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
|
Paper Information |
Registration To |
SP |
Conference Code |
2010-01-PRMU-SP-MVE-CQ |
Language |
Japanese |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
Online speaker clustering using an ergodic HMM and its application to meeting minute generation |
Sub Title (in English) |
|
Keyword(1) |
Hidden Markov models |
Keyword(2) |
variational Bayesian inference |
Keyword(3) |
model selection |
Keyword(4) |
meeting speech recognition |
Keyword(5) |
|
Keyword(6) |
|
Keyword(7) |
|
Keyword(8) |
|
1st Author's Name |
Takafumi Koshinaka |
1st Author's Affiliation |
Common Platform Software Res. Labs., NEC Corporation (NEC Corp.) |
2nd Author's Name |
Kentaro Nagatomo |
2nd Author's Affiliation |
Common Platform Software Res. Labs., NEC Corporation (NEC Corp.) |
3rd Author's Name |
Kenji Satoh |
3rd Author's Affiliation |
Common Platform Software Res. Labs., NEC Corporation (NEC Corp.) |
4th Author's Name |
|
4th Author's Affiliation |
() |
5th Author's Name |
|
5th Author's Affiliation |
() |
6th Author's Name |
|
6th Author's Affiliation |
() |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2010-01-21 11:40:00 |
Presentation Time |
30 minutes |
Registration for |
SP |
Paper # |
CQ2009-62, PRMU2009-161, SP2009-102, MVE2009-84 |
Volume (vol) |
vol.109 |
Number (no) |
no.373(CQ), no.374(PRMU), no.375(SP), no.376(MVE) |
Page |
pp.39-44 |
#Pages |
6 |
Date of Issue |
2010-01-14 (CQ, PRMU, SP, MVE) |
|