講演のリアルタイム字幕生成のための逐次的な改行挿入
講演のリアルタイム字幕生成のための逐次的な改行挿入
カテゴリ: 論文誌(論文単位)
グループ名: 【C】電子・情報・システム部門
発行日: 2013/02/01
タイトル(英語): Sequential Linefeed Insertion into Lecture Transcription for Real-time Captioning
著者名: 大野 誠寛(名古屋大学情報基盤センター),村田 匡輝(名古屋大学大学院情報科学研究科),松原 茂樹(名古屋大学大学院情報科学研究科)
著者名(英語): Tomohiro Ohno (Information Technology Center, Nagoya University), Masaki Murata (Graduate School of Information Science, Nagoya University), Shigeki Matsubara (Graduate School of Information Science, Nagoya University)
キーワード: 音声言語,節境界,係り受け構造,音声コーパス,情報保障 spoken language,clause boundary,dependency structure,speech corpus,information support
要約(英語): To generate readable captions for Japanese spoken monologue such as lectures in real time, it is necessary to sequentially display the caption into which proper linefeeds are inserted. This paper proposes a technique for sequentially inserting proper linefeeds into the lecture transcript whenever a bunsetsu, which is a linguistic unit shorter than a sentence in Japanese that roughly corresponds to a basic phrase in English, is identified. Assuming that linefeeds are inserted into bunsetsu boundaries, the technique can make the delay time of captioning shortest. The technique statistically judges whether or not a linefeed should be inserted into each bunsetsu boundary by using the information which is available at the time. We conducted experiments on linefeed insertion using a Japanese lecture corpus. From the experimental results, we confirmed that our method, which is bunsetsu-based linefeed insertion method, had almost as much accuracy as the sentence-based linefeed insertion method. In addition, we conducted the comparative evaluations with four baseline methods. As the results, we confirmed that our method could insert linefeeds more properly than the simple methods which are thought to have as same delay time as our method.
本誌: 電気学会論文誌C(電子・情報・システム部門誌) Vol.133 No.2 (2013) 特集:省電力時代の電子回路技術
本誌掲載ページ: 418-426 p
原稿種別: 論文/日本語
電子版へのリンク: https://www.jstage.jst.go.jp/article/ieejeiss/133/2/133_418/_article/-char/ja/
受取状況を読み込めませんでした
