Single-Channel Multiple Regression for In-Car Speech Enhancement

LI, Weifeng; ITOU, Katsunobu; TAKEDA, Kazuya; ITAKURA, Fumitada

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

Single-Channel Multiple Regression for In-Car Speech Enhancement

http://hdl.handle.net/2237/15051

名前 / ファイル	ライセンス	アクション
430.pdf (519.3 kB)

Item type

学術雑誌論文 / Journal Article(1)

公開日

2011-07-06

タイトル

Single-Channel Multiple Regression for In-Car Speech Enhancement

言語

著者

LI, Weifeng
ITOU, Katsunobu
TAKEDA, Kazuya
ITAKURA, Fumitada

アクセス権

open access

アクセス権URI

http://purl.org/coar/access_right/c_abf2

権利

言語

権利情報

キーワード

主題Scheme

Other

主題

speech enhancement

キーワード

主題Scheme

Other

主題

speech recognition

キーワード

主題Scheme

Other

主題

multi-layer perceptron

キーワード

主題Scheme

Other

主題

mean opinion score

キーワード

主題Scheme

Other

主題

pairwise preference test

キーワード

主題Scheme

Other

主題

environmental adaptation

キーワード

主題Scheme

Other

主題

K-means clustering

抄録

内容記述

We address issues for improving hands-free speech enhancement and speech recognition performance in different car environments using a single distant microphone. This paper describes a new single-channel in-car speech enhancement method that estimates the log spectra of speech at a close-talking microphone based on the nonlinear regression of the log spectra of noisy signal captured by a distant microphone and the estimated noise. The proposed method provides significant overall quality improvements in our subjective evaluation on the regression-enhanced speech, and performed best in most objective measures. Based on our isolated word recognition experiments conducted under 15 real car environments, the proposed adaptive nonlinear regression approach shows an advantage in average relative word error rate (WER) reductions of 50.8% and 13.1%, respectively, compared to original noisy speech and ETSI advanced front-end (ETSI ES 202 050).

言語

内容記述タイプ

Abstract

出版者

言語

出版者

Institute of Electronics, Information and Communication Engineers

言語

eng

資源タイプ

資源タイプresource

http://purl.org/coar/resource_type/c_6501

タイプ

journal article

出版タイプ

VoR

出版タイプResource

http://purl.org/coar/version/c_970fb48d4fbd8a85

Versions

Ver.1

2021-03-01 18:37:12.356409

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

Single-Channel Multiple Regression for In-Car Speech Enhancement

× LI, Weifeng

× ITOU, Katsunobu

× TAKEDA, Kazuya

× ITAKURA, Fumitada

Versions

Share

Cite as

エクスポート