Committee |
Date Time |
Place |
Paper Title / Authors |
Abstract |
Paper # |
WIT, HI-SIGACI |
2021-12-09 13:25 |
Online |
Online |
Acceptability of Distributed Audio Descriptions Complementing Live TV Sports Programs through Practical Experiment Manon Ichiki, Masaru Miyazaki (NHK), Atsushi Imai, Tohru Takagi (NHK-ES) WIT2021-37 |
We are conducting research with the aim of realizing a new type of audio descriptions with the aim of further enhancing ... [more] |
WIT2021-37 pp.28-33 |
NLC, IPSJ-NL, SP, IPSJ-SLP [detail] |
2021-12-03 11:00 |
Online |
Online |
Multi-speaker Audiobook Speech Synthesis using Discrete Character Acting Styles Acquired by VQVAE Wataru Nakata, Tomoki Koriyama, Shinnosuke Takamichi, Yuki Saito (UT), Yusuke Ijima, Ryo Masumura (NTT), Hiroshi Saruwatari (UT) NLC2021-26 SP2021-47 |
In this paper, we propose a method of extracting discrete character acting styles using vector quantized variational aut... [more] |
NLC2021-26 SP2021-47 pp.42-47 |
EMM, EA, ASJ-H |
2021-11-15 09:00 |
Online |
Online |
[Poster Presentation]
Speech Manipulation Detection Method Using Audio Watermarking
-- Frame Synchronization Method -- Kota Muroi, Kazuhiro Kondo (Yamagata Univ.) EA2021-33 EMM2021-60 |
The tampering detection method using digital watermarking in interrogation audio has a problem that many false positives... [more] |
EA2021-33 EMM2021-60 pp.37-42 |
EMM, EA, ASJ-H |
2021-11-15 13:30 |
Online |
Online |
[Poster Presentation]
Playback Sound Degradation of Digital Audio Equipment Caused by High Frequency Electromagnetic Noise Applied via USB Yuki Matsuo, Takahiro Yoshida (Tokyo Univ. of Science) EA2021-41 EMM2021-68 |
In recent years, PC (personal computer) audio has become popular and the sound quality of digital audio equipment has be... [more] |
EA2021-41 EMM2021-68 pp.80-84 |
EA, ASJ-H |
2021-08-20 16:25 |
Online |
Online |
The effect of realization model on the spatial accuracy of binaural synthesis based on spatial function Wataru Takemoto, Shuichi Sakamoto (Tohoku Univ.) EA2021-26 |
In this paper, we investigated the effect of the realization method on the accuracy of sound space synthesized by the bi... [more] |
EA2021-26 pp.33-38 |
CQ |
2021-08-04 15:55 |
Online |
Online |
The Effect of Content Store Size on QoE of Video and Audio Transmission in ICN/CCN Keisuke Kobayashi, Toshiro Nunome (Nagoya Inst. of Tech) CQ2021-35 |
In this paper, we consider H.264 video and audio transmission in ICN (Information-Centric Networking) / CCN (Content-Cen... [more] |
CQ2021-35 pp.70-74 |
CQ |
2021-08-04 16:20 |
Online |
Online |
Comparison of Reliable Transmission Schemes using Retransmission and AL-FEC on Audiovisual Groupcast over Wireless LANs Minoru Ishida, Toshiro Nunome (Nagoya Inst. of Tech) CQ2021-36 |
In this report, we compare QoS and QoE of reliable transmission methods using retransmission and that using AL-FEC on au... [more] |
CQ2021-36 pp.75-80 |
EA, ASJ-H |
2021-07-16 13:00 |
Online |
Online |
Sound event detection based on complementary-label learning Keigo Wakayama, Shoichiro Saito (NTT) EA2021-17 |
Sound Event Detection (SED) is an important research field that can be applied to smart cities, and etc. SED estimate th... [more] |
EA2021-17 pp.77-82 |
SP, IPSJ-SLP, IPSJ-MUS |
2021-06-18 15:00 |
Online |
Online |
A Beginner's Introduction to Sound Programming for Digital Stomp Boxes Naofumi Aoki (Hokkaido Univ.) SP2021-3 |
This study has developed a platform for programmable digital stomp boxes. This paper introduces the overview of our prod... [more] |
SP2021-3 pp.13-18 |
SP, IPSJ-SLP, IPSJ-MUS |
2021-06-18 15:00 |
Online |
Online |
Protection method with audio processing against Audio Adversarial Example Taisei Yamamoto, Yuya Tarutani, Yukinobu Fukusima, Tokumi Yokohira (Okayama Univ) SP2021-4 |
Machine learning technology has improved the recognition accuracy of voice recognition, and demand for voice recognition... [more] |
SP2021-4 pp.19-24 |
WIT |
2021-06-01 14:55 |
Online |
Online |
The relationship between speech rate and environmental noise in synthesized speech for easy listening of movie audio discription Takeya Naono, Sawako Nakajima, Kazutaka Mitobe (Akita Univ) WIT2021-8 |
In recent years, speech synthesis has been used for audio description of movies and videos, and there is a need to impro... [more] |
WIT2021-8 pp.38-42 |
EMM, IT |
2021-05-21 09:30 |
Online |
Online |
An Audio Data Hiding Scheme Utilizing Artificial Flowing Water Sounds Naoyuki Muraoka, Tetsuya Kojima, Raito Matsuzaki (NIT, Tokyo College) IT2021-7 EMM2021-7 |
Data hiding techniques can be used as a communication medium as well as digital watermarking or fingerprinting applicati... [more] |
IT2021-7 EMM2021-7 pp.37-41 |
SC |
2021-03-19 15:45 |
Online |
Online |
Proposal for a personalized adaptive speaker service to support the elderly at home Takumi Akashi, Sachio Saiki, Masahide Nakamura (Kobe Univ.), Kiyoshi Yasuda (OIT) SC2020-41 |
In this study, we aim to realize an assistive technology that can present necessary information to elderly people with c... [more] |
SC2020-41 pp.49-54 |
EA, US, SP, SIP, IPSJ-SLP [detail] |
2021-03-03 14:05 |
Online |
Online |
[Poster Presentation]
Issues on automatic soundscape generation based on image object detection Yoshifumi Chisaki (CIT), Toshiharu Horiuchi (KDDI Research, Inc.) EA2020-66 SIP2020-97 SP2020-31 |
This study describes automatic soundscape generation process for non-audio movie and photo.
The processes consists of ... [more] |
EA2020-66 SIP2020-97 SP2020-31 pp.41-44 |
EA, US, SP, SIP, IPSJ-SLP [detail] |
2021-03-04 10:15 |
Online |
Online |
A quantitative measure of discriminability between NMF dictionaries Eisuke Konno, Daisuke Saito, Nobuaki Minematsu (UTokyo) EA2020-82 SIP2020-113 SP2020-47 |
Supervised nonnegative matrix factorization (NMF) is a popular approach for monaural audio source separation. It realize... [more] |
EA2020-82 SIP2020-113 SP2020-47 pp.134-139 |
EA, US, SP, SIP, IPSJ-SLP [detail] |
2021-03-04 16:45 |
Online |
Online |
Evaluation of Attention Fusion based Audio-Visual Target Speaker Extraction on Real Recordings Hiroshi Sato, Tsubasa Ochiai, Keisuke Kinoshita, Marc Delcroix, Tomohiro Nakatani, Shoko Araki (NTT) EA2020-88 SIP2020-119 SP2020-53 |
The audio-visual target speech extraction, which aims at extracting a target speaker's voice from a mixture with audio a... [more] |
EA2020-88 SIP2020-119 SP2020-53 pp.170-175 |
WIT, ASJ-H |
2021-02-05 13:30 |
Online |
Online |
Development of Audiovisual Learning Materials of Musculoskeletal System Reiya Sato, Takeaki Shionome (Teikyo Univ.) WIT2020-23 |
In this paper, we develop a smartphone application for learning musculoskeletal knowledge, which is essential for passin... [more] |
WIT2020-23 pp.1-4 |
CAS, ICTSSL |
2021-01-29 15:45 |
Online |
Online |
Reproduction of Japanese drumming rhythm by Deep Neural Network(DNN) Kazumi Okamoto, Hiroshi Tamura (Chuo Univ.) CAS2020-69 ICTSSL2020-54 |
Recently, artificial intelligence has been applied to music, such as automatic music generation. In this paper, we propo... [more] |
CAS2020-69 ICTSSL2020-54 pp.158-161 |
EA |
2020-12-14 09:40 |
Online |
Online |
Sound source separation method for use in live concerts Ryotaro Yamada, Kota Takahashi (UEC) EA2020-47 |
In live concerts, musical instrument sounds are mixed at vocal microphone, which makes it difficult to mix properly. To ... [more] |
EA2020-47 pp.7-12 |
SIP |
2020-08-28 10:30 |
Online |
Online |
Improvement Convergence Rate of the Sign Algorithm by Natural Gradient Method Taiyo Mineo, Hayaru Shouno (UEC) SIP2020-34 |
In lossless audio compression, it is essential to predictive residuals to be sparse, since we apply entropy codings to r... [more] |
SIP2020-34 pp.19-24 |