Closed-Form Pre-Training for Small-Sample Environmental Sound Recognition

Nakamasa Inoue; Keita Goto

論文・著書情報

タイトル

和文:
英文:	Closed-Form Pre-Training for Small-Sample Environmental Sound Recognition

著者

和文:	井上中順, Goto Keita.
英文:	Nakamasa Inoue, Keita Goto.

言語

English

掲載誌/書名

和文:
英文:	2020 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)

巻, 号, ページ

pp. 1693-1697

出版年月

2020年12月31日

出版者

和文:
英文:	IEEE

会議名称

和文:
英文:	Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2020（APSIPA ASC）

開催地

和文:
英文:

公式リンク

http://www.apsipa.org/proceedings/2020/APSIPA-ASC-2020.html

アブストラクト

This paper presents a framework for pre-training neural networks, namely closed-form pre-training, and we apply it to small-sample environmental sound recognition. Our main idea is to pre-train neural networks on a dataset automatically gener- ated by some formulas, without any prior real-world recordings or manual annotation. Specifically, the proposed framework consists of two steps. First, an audio classification dataset is generated. Here, we propose three types of dataset definitions using colored noise and its extensions. Second, a network is pre-trained on the generated dataset. The obtained pre-trained network is particularly effective for fine-tuning with few examples because it helps optimization methods avoid falling into a premature local optimal solution. In experiments, we demonstrate the effectiveness of the proposed framework for small-sample environmental sound recognition on three datasets: ESC-10/50, and UrbanSound8K. We obtained performance improvement on all datasets with a small number of training samples.

Home

各種検索

サポート

T2R2について

関連リンク

論文・著書情報