Action Sequence Recognition in Videos by Combining a CTC Network with a Statistical Language Model

Mengxi Lin; Nakamasa Inoue; Koichi Shinoda

論文・著書情報

タイトル

和文:
英文:	Action Sequence Recognition in Videos by Combining a CTC Network with a Statistical Language Model

著者

和文:	Lin Mengxi, 井上中順, 篠田浩一.
英文:	Mengxi Lin, Nakamasa Inoue, Koichi Shinoda.

言語

English

掲載誌/書名

和文:
英文:	Technical Reports of IEICE PRMU

巻, 号, ページ

vol. 117 no. 362 pp. 1-6

出版年月

2017年12月16日

出版者

和文:	電子情報通信学会
英文:

会議名称

和文:
英文:	Pattern Recognition and Media Understanding (PRMU) 2017-12

開催地

和文:
英文:	慶應義塾大学理工学部矢上キャンパス

ファイル

公式リンク

http://www.ieice.org/ken/paper/20171217m1A7/
http://www.ieice.org/ken/program/index.php?tgs_regid=dcb3f5da17b46802a7ac54d2b097a59876c6cb3ce4786e95505fbc3a90fe24d9&tgid=IEICE-PRMU&lang=

アブストラクト

Action sequence recognition aims to recognize what actions occur in a video and their temporal order. In this paper, we propose to combine an LSTM network trained with Connectionist Temporal Classification (CTC) with a statistical language model for action sequence recognition. The statistical language model captures the relations between action instances, which are hardly learned by the CTC network. Our experiments on the Breakfast dataset show that the statistical language model can significantly boost the recognition accuracy of the CTC network, from 37.0% to 43.4%.

Home

各種検索

サポート

T2R2について

関連リンク

論文・著書情報