WEKO3
アイテム
{"_buckets": {"deposit": "c8ab3d7d-b40d-4b87-8c21-e1c33db7693a"}, "_deposit": {"id": "8767", "owners": [], "pid": {"revision_id": 0, "type": "depid", "value": "8767"}, "status": "published"}, "_oai": {"id": "oai:nagoya.repo.nii.ac.jp:00008767", "sets": ["314"]}, "author_link": ["24670", "24671", "24672", "24673", "24674", "24675", "24676", "24677"], "item_10_alternative_title_19": {"attribute_name": "その他のタイトル", "attribute_value_mlt": [{"subitem_alternative_title": "Speech Spotter: Speech Input Interface Capable of Using Speech Recognition in the Midst of Human-Human Conversation", "subitem_alternative_title_language": "en"}]}, "item_10_biblio_info_6": {"attribute_name": "書誌情報", "attribute_value_mlt": [{"bibliographicIssueDates": {"bibliographicIssueDate": "2007", "bibliographicIssueDateType": "Issued"}, "bibliographicIssueNumber": "3", "bibliographicPageEnd": "1283", "bibliographicPageStart": "1274", "bibliographicVolumeNumber": "48", "bibliographic_titles": [{"bibliographic_title": "情報処理学会論文誌", "bibliographic_titleLang": "ja"}]}]}, "item_10_description_4": {"attribute_name": "抄録", "attribute_value_mlt": [{"subitem_description": "本論文では,人間同士の会話中に音声認織システムへ音声コマンドを入力できる「音声スポッタ」という音声インタフェース機能を提案する.従来,会話中のユーザの音声が,音声認識システムと会話相手の人のどちらに対する発話かを,マイク入力による音声だけから識別することは国雄だったため,人間同士の会話中に音声認敢システムは利用されていなかった.音声スポッタでは,音声に含まれる非言語情報の中から,有声休止(「えー」のように母音の引き延ばし)による言い淀みと.声の高さの2種類を活用することで,各発話が音声認識システムに入力されるかどうかを,ユーザが意図的に制御できるようにする.具体的には,母音を延ばして言い淀んだ後に故意に高い声で発声された特殊な(不自然な)発話だけを音声認識対象と見なし,通常の会話中の発話は軽視することで会話の支援を実現する.その応用例として我々は,会話中のユーザに各種情報支援をする「オンデマンド会話支援システム」と,電話での通話中にユーザがBGM を選曲・再生できる「BGM付き電話システム」の2つを構築した.音声スポッタによる発話の検出性能の評価結果やこれらのシステムの試用を通じて,本機能が頑健で便利であることを確認した. ", "subitem_description_language": "ja", "subitem_description_type": "Abstract"}, {"subitem_description": "This paper describes a speech-interface function, called \"Speech Spotter\", which enables a user to enter voice commands into a speech recognizer in the midst of natural humanhuman conversation. In the past, it has been difficult to use automatic speech recognition in human-human conversation since it was not easy to judge, from only microphone input, whether a user was speaking to another person or a speech recognizer. We enable a user to intentionally control whether each utterance is to be accepted (processed) by the speech recognizer by using two kinds of nonverbal speech information: a filled pause (a vowel-lengthening hesitation like \"er... \") and voice pitch. Speech Spotter regards a user utterance as a command utterance only when it is uttered with a high pitch just after a filled pause. In other words, this function accepts this specially-designed unnatural utterance only and ignores other normal utterances. By using Speech Spotter, we have built two application systems: an ondemand information system for assisting human-human conversation and a music-playback system for enriching telephone conversation. The results from evaluating this function and using these systems have shown that Speech Spotter is robust and convenient enough to be used in face-to-face or cellular-phone conversations.", "subitem_description_language": "en", "subitem_description_type": "Abstract"}]}, "item_10_identifier_60": {"attribute_name": "URI", "attribute_value_mlt": [{"subitem_identifier_type": "HDL", "subitem_identifier_uri": "http://hdl.handle.net/2237/10518"}]}, "item_10_publisher_32": {"attribute_name": "出版者", "attribute_value_mlt": [{"subitem_publisher": "情報処理学会", "subitem_publisher_language": "ja"}]}, "item_10_rights_12": {"attribute_name": "権利", "attribute_value_mlt": [{"subitem_rights": "ここに掲載した著作物の利用に関する注意 本著作物の著作権は(社)情報処理学会に帰属します。本著作物は著作権者である情報処理学会の許可のもとに掲載するものです。ご利用に当たっては「著作権法」ならびに「情報処理学会倫理綱領」に従うことをお願いいたします。 ", "subitem_rights_language": "ja"}, {"subitem_rights": "Notice for the use of this material The copyright of this material is retained by the Information Processing Society of Japan (IPSJ). This material is published on this web site with the agreement of the author (s) and the IPSJ. Please be complied with Copyright Law of Japan and the Code of Ethics of the IPSJ if any users wish to reproduce, make derivative work, distribute or make available to the public any part or whole thereof. All Rights Reserved, Copyright (C) Information Processing Society of Japan. Comments are welcome. Mail to address: editj\u003cat\u003eipsj.or.jp, please.", "subitem_rights_language": "en"}]}, "item_10_select_15": {"attribute_name": "著者版フラグ", "attribute_value_mlt": [{"subitem_select_item": "publisher"}]}, "item_10_source_id_7": {"attribute_name": "ISSN", "attribute_value_mlt": [{"subitem_source_identifier": "03875806", "subitem_source_identifier_type": "PISSN"}]}, "item_10_text_14": {"attribute_name": "フォーマット", "attribute_value_mlt": [{"subitem_text_value": "application/pdf"}]}, "item_1615787544753": {"attribute_name": "出版タイプ", "attribute_value_mlt": [{"subitem_version_resource": "http://purl.org/coar/version/c_970fb48d4fbd8a85", "subitem_version_type": "VoR"}]}, "item_access_right": {"attribute_name": "アクセス権", "attribute_value_mlt": [{"subitem_access_right": "open access", "subitem_access_right_uri": "http://purl.org/coar/access_right/c_abf2"}]}, "item_creator": {"attribute_name": "著者", "attribute_type": "creator", "attribute_value_mlt": [{"creatorNames": [{"creatorName": "後藤, 真孝", "creatorNameLang": "ja"}], "nameIdentifiers": [{"nameIdentifier": "24670", "nameIdentifierScheme": "WEKO"}]}, {"creatorNames": [{"creatorName": "Goto, Masataka", "creatorNameLang": "en"}], "nameIdentifiers": [{"nameIdentifier": "24671", "nameIdentifierScheme": "WEKO"}]}, {"creatorNames": [{"creatorName": "北山, 広治", "creatorNameLang": "ja"}], "nameIdentifiers": [{"nameIdentifier": "24672", "nameIdentifierScheme": "WEKO"}]}, {"creatorNames": [{"creatorName": "Kitayama, Koji", "creatorNameLang": "en"}], "nameIdentifiers": [{"nameIdentifier": "24673", "nameIdentifierScheme": "WEKO"}]}, {"creatorNames": [{"creatorName": "伊藤, 克亘", "creatorNameLang": "ja"}], "nameIdentifiers": [{"nameIdentifier": "24674", "nameIdentifierScheme": "WEKO"}]}, {"creatorNames": [{"creatorName": "Ito, Katunobu", "creatorNameLang": "en"}], "nameIdentifiers": [{"nameIdentifier": "24675", "nameIdentifierScheme": "WEKO"}]}, {"creatorNames": [{"creatorName": "小林, 哲則", "creatorNameLang": "ja"}], "nameIdentifiers": [{"nameIdentifier": "24676", "nameIdentifierScheme": "WEKO"}]}, {"creatorNames": [{"creatorName": "Kobayashi, Tetsunori", "creatorNameLang": "en"}], "nameIdentifiers": [{"nameIdentifier": "24677", "nameIdentifierScheme": "WEKO"}]}]}, "item_files": {"attribute_name": "ファイル情報", "attribute_type": "file", "attribute_value_mlt": [{"accessrole": "open_date", "date": [{"dateType": "Available", "dateValue": "2018-02-19"}], "displaytype": "detail", "download_preview_message": "", "file_order": 0, "filename": "48-3-1274.pdf", "filesize": [{"value": "1.2 MB"}], "format": "application/pdf", "future_date_message": "", "is_thumbnail": false, "licensetype": "license_note", "mimetype": "application/pdf", "size": 1200000.0, "url": {"label": "48-3-1274.pdf", "objectType": "fulltext", "url": "https://nagoya.repo.nii.ac.jp/record/8767/files/48-3-1274.pdf"}, "version_id": "a0283c89-f287-490e-bdd1-e33eff5fc08e"}]}, "item_language": {"attribute_name": "言語", "attribute_value_mlt": [{"subitem_language": "jpn"}]}, "item_resource_type": {"attribute_name": "資源タイプ", "attribute_value_mlt": [{"resourcetype": "journal article", "resourceuri": "http://purl.org/coar/resource_type/c_6501"}]}, "item_title": "音声スポッタ:人間同士の会話中に音声認識が利用可能な音声入力インタフェース", "item_titles": {"attribute_name": "タイトル", "attribute_value_mlt": [{"subitem_title": "音声スポッタ:人間同士の会話中に音声認識が利用可能な音声入力インタフェース", "subitem_title_language": "ja"}]}, "item_type_id": "10", "owner": "1", "path": ["314"], "permalink_uri": "http://hdl.handle.net/2237/10518", "pubdate": {"attribute_name": "PubDate", "attribute_value": "2008-09-12"}, "publish_date": "2008-09-12", "publish_status": "0", "recid": "8767", "relation": {}, "relation_version_is_last": true, "title": ["音声スポッタ:人間同士の会話中に音声認識が利用可能な音声入力インタフェース"], "weko_shared_id": -1}
音声スポッタ:人間同士の会話中に音声認識が利用可能な音声入力インタフェース
http://hdl.handle.net/2237/10518
http://hdl.handle.net/2237/105187817bad2-4eef-49b5-9c39-70ec4c0db812
名前 / ファイル | ライセンス | アクション |
---|---|---|
48-3-1274.pdf (1.2 MB)
|
|
Item type | 学術雑誌論文 / Journal Article(1) | |||||
---|---|---|---|---|---|---|
公開日 | 2008-09-12 | |||||
タイトル | ||||||
タイトル | 音声スポッタ:人間同士の会話中に音声認識が利用可能な音声入力インタフェース | |||||
言語 | ja | |||||
その他のタイトル | ||||||
その他のタイトル | Speech Spotter: Speech Input Interface Capable of Using Speech Recognition in the Midst of Human-Human Conversation | |||||
言語 | en | |||||
著者 |
後藤, 真孝
× 後藤, 真孝× Goto, Masataka× 北山, 広治× Kitayama, Koji× 伊藤, 克亘× Ito, Katunobu× 小林, 哲則× Kobayashi, Tetsunori |
|||||
アクセス権 | ||||||
アクセス権 | open access | |||||
アクセス権URI | http://purl.org/coar/access_right/c_abf2 | |||||
権利 | ||||||
言語 | ja | |||||
権利情報 | ここに掲載した著作物の利用に関する注意 本著作物の著作権は(社)情報処理学会に帰属します。本著作物は著作権者である情報処理学会の許可のもとに掲載するものです。ご利用に当たっては「著作権法」ならびに「情報処理学会倫理綱領」に従うことをお願いいたします。 | |||||
権利 | ||||||
言語 | en | |||||
権利情報 | Notice for the use of this material The copyright of this material is retained by the Information Processing Society of Japan (IPSJ). This material is published on this web site with the agreement of the author (s) and the IPSJ. Please be complied with Copyright Law of Japan and the Code of Ethics of the IPSJ if any users wish to reproduce, make derivative work, distribute or make available to the public any part or whole thereof. All Rights Reserved, Copyright (C) Information Processing Society of Japan. Comments are welcome. Mail to address: editj<at>ipsj.or.jp, please. | |||||
抄録 | ||||||
内容記述 | 本論文では,人間同士の会話中に音声認織システムへ音声コマンドを入力できる「音声スポッタ」という音声インタフェース機能を提案する.従来,会話中のユーザの音声が,音声認識システムと会話相手の人のどちらに対する発話かを,マイク入力による音声だけから識別することは国雄だったため,人間同士の会話中に音声認敢システムは利用されていなかった.音声スポッタでは,音声に含まれる非言語情報の中から,有声休止(「えー」のように母音の引き延ばし)による言い淀みと.声の高さの2種類を活用することで,各発話が音声認識システムに入力されるかどうかを,ユーザが意図的に制御できるようにする.具体的には,母音を延ばして言い淀んだ後に故意に高い声で発声された特殊な(不自然な)発話だけを音声認識対象と見なし,通常の会話中の発話は軽視することで会話の支援を実現する.その応用例として我々は,会話中のユーザに各種情報支援をする「オンデマンド会話支援システム」と,電話での通話中にユーザがBGM を選曲・再生できる「BGM付き電話システム」の2つを構築した.音声スポッタによる発話の検出性能の評価結果やこれらのシステムの試用を通じて,本機能が頑健で便利であることを確認した. | |||||
言語 | ja | |||||
内容記述タイプ | Abstract | |||||
抄録 | ||||||
内容記述 | This paper describes a speech-interface function, called "Speech Spotter", which enables a user to enter voice commands into a speech recognizer in the midst of natural humanhuman conversation. In the past, it has been difficult to use automatic speech recognition in human-human conversation since it was not easy to judge, from only microphone input, whether a user was speaking to another person or a speech recognizer. We enable a user to intentionally control whether each utterance is to be accepted (processed) by the speech recognizer by using two kinds of nonverbal speech information: a filled pause (a vowel-lengthening hesitation like "er... ") and voice pitch. Speech Spotter regards a user utterance as a command utterance only when it is uttered with a high pitch just after a filled pause. In other words, this function accepts this specially-designed unnatural utterance only and ignores other normal utterances. By using Speech Spotter, we have built two application systems: an ondemand information system for assisting human-human conversation and a music-playback system for enriching telephone conversation. The results from evaluating this function and using these systems have shown that Speech Spotter is robust and convenient enough to be used in face-to-face or cellular-phone conversations. | |||||
言語 | en | |||||
内容記述タイプ | Abstract | |||||
出版者 | ||||||
言語 | ja | |||||
出版者 | 情報処理学会 | |||||
言語 | ||||||
言語 | jpn | |||||
資源タイプ | ||||||
資源タイプresource | http://purl.org/coar/resource_type/c_6501 | |||||
タイプ | journal article | |||||
出版タイプ | ||||||
出版タイプ | VoR | |||||
出版タイプResource | http://purl.org/coar/version/c_970fb48d4fbd8a85 | |||||
ISSN | ||||||
収録物識別子タイプ | PISSN | |||||
収録物識別子 | 03875806 | |||||
書誌情報 |
ja : 情報処理学会論文誌 巻 48, 号 3, p. 1274-1283, 発行日 2007 |
|||||
フォーマット | ||||||
application/pdf | ||||||
著者版フラグ | ||||||
値 | publisher | |||||
URI | ||||||
識別子 | http://hdl.handle.net/2237/10518 | |||||
識別子タイプ | HDL |