WEKO3
アイテム
{"_buckets": {"deposit": "8dd0d83f-de75-4c5c-948e-1b91504202df"}, "_deposit": {"id": "13156", "owners": [], "pid": {"revision_id": 0, "type": "depid", "value": "13156"}, "status": "published"}, "_oai": {"id": "oai:nagoya.repo.nii.ac.jp:00013156", "sets": ["322"]}, "author_link": ["41481", "41482", "41483", "41484"], "item_10_biblio_info_6": {"attribute_name": "書誌情報", "attribute_value_mlt": [{"bibliographicIssueDates": {"bibliographicIssueDate": "2006-03-01", "bibliographicIssueDateType": "Issued"}, "bibliographicIssueNumber": "3", "bibliographicPageEnd": "1039", "bibliographicPageStart": "1032", "bibliographicVolumeNumber": "E89-D", "bibliographic_titles": [{"bibliographic_title": "IEICE transactions on information and systems", "bibliographic_titleLang": "en"}]}]}, "item_10_description_4": {"attribute_name": "抄録", "attribute_value_mlt": [{"subitem_description": "We address issues for improving hands-free speech enhancement and speech recognition performance in different car environments using a single distant microphone. This paper describes a new single-channel in-car speech enhancement method that estimates the log spectra of speech at a close-talking microphone based on the nonlinear regression of the log spectra of noisy signal captured by a distant microphone and the estimated noise. The proposed method provides significant overall quality improvements in our subjective evaluation on the regression-enhanced speech, and performed best in most objective measures. Based on our isolated word recognition experiments conducted under 15 real car environments, the proposed adaptive nonlinear regression approach shows an advantage in average relative word error rate (WER) reductions of 50.8% and 13.1%, respectively, compared to original noisy speech and ETSI advanced front-end (ETSI ES 202 050).", "subitem_description_language": "en", "subitem_description_type": "Abstract"}]}, "item_10_identifier_60": {"attribute_name": "URI", "attribute_value_mlt": [{"subitem_identifier_type": "URI", "subitem_identifier_uri": "http://www.ieice.org/jpn/trans_online/index.html"}, {"subitem_identifier_type": "HDL", "subitem_identifier_uri": "http://hdl.handle.net/2237/15051"}]}, "item_10_publisher_32": {"attribute_name": "出版者", "attribute_value_mlt": [{"subitem_publisher": "Institute of Electronics, Information and Communication Engineers", "subitem_publisher_language": "en"}]}, "item_10_relation_43": {"attribute_name": "関連情報", "attribute_value_mlt": [{"subitem_relation_type": "isVersionOf", "subitem_relation_type_id": {"subitem_relation_type_id_text": "http://www.ieice.org/jpn/trans_online/index.html", "subitem_relation_type_select": "URI"}}]}, "item_10_rights_12": {"attribute_name": "権利", "attribute_value_mlt": [{"subitem_rights": "Copyright (C) 2006 IEICE", "subitem_rights_language": "en"}]}, "item_10_select_15": {"attribute_name": "著者版フラグ", "attribute_value_mlt": [{"subitem_select_item": "publisher"}]}, "item_10_source_id_7": {"attribute_name": "ISSN", "attribute_value_mlt": [{"subitem_source_identifier": "0916-8532", "subitem_source_identifier_type": "PISSN"}]}, "item_1615787544753": {"attribute_name": "出版タイプ", "attribute_value_mlt": [{"subitem_version_resource": "http://purl.org/coar/version/c_970fb48d4fbd8a85", "subitem_version_type": "VoR"}]}, "item_access_right": {"attribute_name": "アクセス権", "attribute_value_mlt": [{"subitem_access_right": "open access", "subitem_access_right_uri": "http://purl.org/coar/access_right/c_abf2"}]}, "item_creator": {"attribute_name": "著者", "attribute_type": "creator", "attribute_value_mlt": [{"creatorNames": [{"creatorName": "LI, Weifeng", "creatorNameLang": "en"}], "nameIdentifiers": [{"nameIdentifier": "41481", "nameIdentifierScheme": "WEKO"}]}, {"creatorNames": [{"creatorName": "ITOU, Katsunobu", "creatorNameLang": "en"}], "nameIdentifiers": [{"nameIdentifier": "41482", "nameIdentifierScheme": "WEKO"}]}, {"creatorNames": [{"creatorName": "TAKEDA, Kazuya", "creatorNameLang": "en"}], "nameIdentifiers": [{"nameIdentifier": "41483", "nameIdentifierScheme": "WEKO"}]}, {"creatorNames": [{"creatorName": "ITAKURA, Fumitada", "creatorNameLang": "en"}], "nameIdentifiers": [{"nameIdentifier": "41484", "nameIdentifierScheme": "WEKO"}]}]}, "item_files": {"attribute_name": "ファイル情報", "attribute_type": "file", "attribute_value_mlt": [{"accessrole": "open_date", "date": [{"dateType": "Available", "dateValue": "2018-02-20"}], "displaytype": "detail", "download_preview_message": "", "file_order": 0, "filename": "430.pdf", "filesize": [{"value": "519.3 kB"}], "format": "application/pdf", "future_date_message": "", "is_thumbnail": false, "licensetype": "license_note", "mimetype": "application/pdf", "size": 519299.99999999994, "url": {"label": "430.pdf", "objectType": "fulltext", "url": "https://nagoya.repo.nii.ac.jp/record/13156/files/430.pdf"}, "version_id": "07ca76ba-7890-45bc-a3a5-e4e35c308182"}]}, "item_keyword": {"attribute_name": "キーワード", "attribute_value_mlt": [{"subitem_subject": "speech enhancement", "subitem_subject_scheme": "Other"}, {"subitem_subject": "speech recognition", "subitem_subject_scheme": "Other"}, {"subitem_subject": "multi-layer perceptron", "subitem_subject_scheme": "Other"}, {"subitem_subject": "mean opinion score", "subitem_subject_scheme": "Other"}, {"subitem_subject": "pairwise preference test", "subitem_subject_scheme": "Other"}, {"subitem_subject": "environmental adaptation", "subitem_subject_scheme": "Other"}, {"subitem_subject": "K-means clustering", "subitem_subject_scheme": "Other"}]}, "item_language": {"attribute_name": "言語", "attribute_value_mlt": [{"subitem_language": "eng"}]}, "item_resource_type": {"attribute_name": "資源タイプ", "attribute_value_mlt": [{"resourcetype": "journal article", "resourceuri": "http://purl.org/coar/resource_type/c_6501"}]}, "item_title": "Single-Channel Multiple Regression for In-Car Speech Enhancement", "item_titles": {"attribute_name": "タイトル", "attribute_value_mlt": [{"subitem_title": "Single-Channel Multiple Regression for In-Car Speech Enhancement", "subitem_title_language": "en"}]}, "item_type_id": "10", "owner": "1", "path": ["322"], "permalink_uri": "http://hdl.handle.net/2237/15051", "pubdate": {"attribute_name": "PubDate", "attribute_value": "2011-07-06"}, "publish_date": "2011-07-06", "publish_status": "0", "recid": "13156", "relation": {}, "relation_version_is_last": true, "title": ["Single-Channel Multiple Regression for In-Car Speech Enhancement"], "weko_shared_id": -1}
Single-Channel Multiple Regression for In-Car Speech Enhancement
http://hdl.handle.net/2237/15051
http://hdl.handle.net/2237/15051d2bcd463-9ace-4ba1-b337-665c39801d2e
名前 / ファイル | ライセンス | アクション |
---|---|---|
430.pdf (519.3 kB)
|
|
Item type | 学術雑誌論文 / Journal Article(1) | |||||
---|---|---|---|---|---|---|
公開日 | 2011-07-06 | |||||
タイトル | ||||||
タイトル | Single-Channel Multiple Regression for In-Car Speech Enhancement | |||||
言語 | en | |||||
著者 |
LI, Weifeng
× LI, Weifeng× ITOU, Katsunobu× TAKEDA, Kazuya× ITAKURA, Fumitada |
|||||
アクセス権 | ||||||
アクセス権 | open access | |||||
アクセス権URI | http://purl.org/coar/access_right/c_abf2 | |||||
権利 | ||||||
言語 | en | |||||
権利情報 | Copyright (C) 2006 IEICE | |||||
キーワード | ||||||
主題Scheme | Other | |||||
主題 | speech enhancement | |||||
キーワード | ||||||
主題Scheme | Other | |||||
主題 | speech recognition | |||||
キーワード | ||||||
主題Scheme | Other | |||||
主題 | multi-layer perceptron | |||||
キーワード | ||||||
主題Scheme | Other | |||||
主題 | mean opinion score | |||||
キーワード | ||||||
主題Scheme | Other | |||||
主題 | pairwise preference test | |||||
キーワード | ||||||
主題Scheme | Other | |||||
主題 | environmental adaptation | |||||
キーワード | ||||||
主題Scheme | Other | |||||
主題 | K-means clustering | |||||
抄録 | ||||||
内容記述 | We address issues for improving hands-free speech enhancement and speech recognition performance in different car environments using a single distant microphone. This paper describes a new single-channel in-car speech enhancement method that estimates the log spectra of speech at a close-talking microphone based on the nonlinear regression of the log spectra of noisy signal captured by a distant microphone and the estimated noise. The proposed method provides significant overall quality improvements in our subjective evaluation on the regression-enhanced speech, and performed best in most objective measures. Based on our isolated word recognition experiments conducted under 15 real car environments, the proposed adaptive nonlinear regression approach shows an advantage in average relative word error rate (WER) reductions of 50.8% and 13.1%, respectively, compared to original noisy speech and ETSI advanced front-end (ETSI ES 202 050). | |||||
言語 | en | |||||
内容記述タイプ | Abstract | |||||
出版者 | ||||||
言語 | en | |||||
出版者 | Institute of Electronics, Information and Communication Engineers | |||||
言語 | ||||||
言語 | eng | |||||
資源タイプ | ||||||
資源タイプresource | http://purl.org/coar/resource_type/c_6501 | |||||
タイプ | journal article | |||||
出版タイプ | ||||||
出版タイプ | VoR | |||||
出版タイプResource | http://purl.org/coar/version/c_970fb48d4fbd8a85 | |||||
関連情報 | ||||||
関連タイプ | isVersionOf | |||||
識別子タイプ | URI | |||||
関連識別子 | http://www.ieice.org/jpn/trans_online/index.html | |||||
ISSN | ||||||
収録物識別子タイプ | PISSN | |||||
収録物識別子 | 0916-8532 | |||||
書誌情報 |
en : IEICE transactions on information and systems 巻 E89-D, 号 3, p. 1032-1039, 発行日 2006-03-01 |
|||||
著者版フラグ | ||||||
値 | publisher | |||||
URI | ||||||
識別子 | http://www.ieice.org/jpn/trans_online/index.html | |||||
識別子タイプ | URI | |||||
URI | ||||||
識別子 | http://hdl.handle.net/2237/15051 | |||||
識別子タイプ | HDL |