口唇動作と音声の共起に着目した被写体と話者の不一致検出 : ニュース映像への適用と評価(萌芽セッション,エンタテインメントのためのメディアとリアリティ)

熊谷, 章吾; 道満, 恵介; 高橋, 友和; 出口, 大輔; 井手, 一郎; 村瀬, 洋; KUMAGAI, Shogo; DOMAN, Keisuke; TAKAHASHI, Tomokazu; DEGUCHI, Daisuke; IDE, Ichiro; MURASE, Hiroshi

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

口唇動作と音声の共起に着目した被写体と話者の不一致検出 : ニュース映像への適用と評価(萌芽セッション,エンタテインメントのためのメディアとリアリティ)

http://hdl.handle.net/2237/23846

名前 / ファイル	ライセンス	アクション
110008726196.pdf (997.2 kB)

Item type

学術雑誌論文 / Journal Article(1)

公開日

2016-03-15

タイトル

口唇動作と音声の共起に着目した被写体と話者の不一致検出 : ニュース映像への適用と評価(萌芽セッション,エンタテインメントのためのメディアとリアリティ)

言語

その他のタイトル

Detection of Inconsistency between Face and Speaker Focusing on the Co-occurrence of Lip Motion and Audio : An Application to News Video and its Evaluation

言語

著者

熊谷, 章吾
道満, 恵介
高橋, 友和
出口, 大輔
井手, 一郎
村瀬, 洋
KUMAGAI, Shogo
DOMAN, Keisuke
TAKAHASHI, Tomokazu
DEGUCHI, Daisuke
IDE, Ichiro
MURASE, Hiroshi

アクセス権

open access

アクセス権URI

http://purl.org/coar/access_right/c_abf2

権利

言語

権利情報

(c)一般社団法人電子情報通信学会本文データは学協会の許諾に基づきCiNiiから複製したものである

キーワード

主題Scheme

Other

主題

発言シーン抽出

キーワード

主題Scheme

Other

主題

視聴覚統合

キーワード

主題Scheme

Other

主題

ニュース映象

キーワード

主題Scheme

Other

主題

口唇動作特徴

キーワード

主題Scheme

Other

主題

speech scene extraction

キーワード

主題Scheme

Other

主題

auditory-visual integration

キーワード

主題Scheme

Other

主題

news video

キーワード

主題Scheme

Other

主題

lip motion feature

抄録

内容記述

ニュース映像中の人物の発言シーンはマルチメディア情報を豊富に含み,資料価値が高い.発言シーンの抽出には顔領域の位置や大きさを利用するアプローチが考えられる.しかし,ナレーションシーンのように被写体と話者が一致していないシーンも存在するため,それだけでは発言シーンを必ずしも抽出できない.そこで我々は,発生する音とそれに伴う口唇動作から得られる複数の音声特徴と画像特徴の相関を利用して被写体と話者の一致・不一致を識別する手法を提案してきた.しかしながら,理想的な環境で撮影した映像に対する評価のみで,実際に放送されるニュース映像に対する評価にとどまっていた.本稿では,理想的な環境で撮影した映像を用いた実験とその結果,および実際に放送されたニュース映像を用いた実験とその結果について報告する.これら2つの実験から,提案手法の有効性および有用性を確認した.

言語

内容記述タイプ

Abstract

抄録

内容記述

Speech scenes in news videos contain a wealth of multimedia information, and are valuable as archived material. In order to extract speech scenes from news videos, there is an approach that uses the position and size of a face region. However, it is difficult to extract them with only the approach, since news videos contain scenes where the speakers are not the subjects such as in narration scenes. To solve this problem, we have been proposing a method to detect the inconsistency between face and speaker focusing on the co-occurrence of the lip motion and the speech. However, the evaluations for the proposed method were performed in an ideal condition without much noise. In this paper, we report the investigation on the performance of the proposed method not only with videos captured in ideal conditions but also with actual broadcasted news videos. Their results showed the effectiveness and the usefulness of our method.

言語

内容記述タイプ

Abstract

内容記述

IEICE Technical Report;MVE2011-12

言語

内容記述タイプ

Other

出版者

言語

出版者

一般社団法人電子情報通信学会

言語

jpn

資源タイプ

資源タイプresource

http://purl.org/coar/resource_type/c_6501

タイプ

journal article

出版タイプ

VoR

出版タイプResource

http://purl.org/coar/version/c_970fb48d4fbd8a85

Versions

Ver.1

2021-03-01 15:15:31.092134

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

口唇動作と音声の共起に着目した被写体と話者の不一致検出 : ニュース映像への適用と評価(萌芽セッション,エンタテインメントのためのメディアとリアリティ)

× 熊谷, 章吾

× 道満, 恵介

× 高橋, 友和

× 出口, 大輔

× 井手, 一郎

× 村瀬, 洋

× KUMAGAI, Shogo

× DOMAN, Keisuke

× TAKAHASHI, Tomokazu

× DEGUCHI, Daisuke

× IDE, Ichiro

× MURASE, Hiroshi

Versions

Share

Cite as

エクスポート