WEKO3
アイテム
Bilingual Speech Dialogue Corpus for Simultaneous Machine Interpretation Research
http://hdl.handle.net/2237/15076
http://hdl.handle.net/2237/15076ce55a5e0-3801-4d7a-bd84-85d17a90b7ac
名前 / ファイル | ライセンス | アクション |
---|---|---|
Oriental_COCOSDA2003_ryu.pdf (209.2 kB)
|
|
Item type | 学術雑誌論文 / Journal Article(1) | |||||
---|---|---|---|---|---|---|
公開日 | 2011-07-15 | |||||
タイトル | ||||||
タイトル | Bilingual Speech Dialogue Corpus for Simultaneous Machine Interpretation Research | |||||
言語 | en | |||||
著者 |
Ryu, Koichiro
× Ryu, Koichiro× Matsubara, Shigeki× Kawaguchi, Nobuo× Inagaki, Yasuyoshi |
|||||
アクセス権 | ||||||
アクセス権 | open access | |||||
アクセス権URI | http://purl.org/coar/access_right/c_abf2 | |||||
抄録 | ||||||
内容記述タイプ | Abstract | |||||
内容記述 | Speech-to-speech translation has been an important research topic with the advance of technologies for speech processing and language processing. This paper describes a bilingual speech dialogue corpus which has been constructed for research on simultaneous machine interpretation at the Center for Integrated Acoustic Information Research (CIAIR), Nagoya University. The corpus has been implemented by collecting simulated cross-lingual conversations between English speech and Japanese speech through simultaneous interpretation, and by transcribing them manually with bilingual sentence alignment. In the year 2002, 216 spoken dialogues have been collected under a real environment, and transcribed into text files consisting of about 300,000 morphemes. In order to utilize the bilingual corpus effectively, every source utterance speech has been segmented into interpreting units according to its word-for-word translation and the word alignment of them. The interpreting unit means a linguistic chunk that could be interpreted separately and simultaneously. This paper has investigated linguistic characters of such the unit, and examined the feasibility of simultaneous machine interpretation. | |||||
言語 | en | |||||
出版者 | ||||||
出版者 | The oriental chapter of COCOSDA (The International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques) | |||||
言語 | en | |||||
言語 | ||||||
言語 | eng | |||||
資源タイプ | ||||||
資源タイプ識別子 | http://purl.org/coar/resource_type/c_6501 | |||||
資源タイプ | journal article | |||||
出版タイプ | ||||||
出版タイプ | AM | |||||
出版タイプResource | http://purl.org/coar/version/c_ab4af688f83e57aa | |||||
書誌情報 |
en : O-COCOSDA 2003 : proceedings of the 6th Oriental COCOSDA workshop in Singapore on Oct 1-3 2003 p. 164-168, 発行日 2003-06-24 |
|||||
著者版フラグ | ||||||
値 | author | |||||
URI | ||||||
識別子 | http://hdl.handle.net/2237/15076 | |||||
識別子タイプ | HDL |