2024-03-28T18:33:15Z
https://nagoya.repo.nii.ac.jp/oai
oai:nagoya.repo.nii.ac.jp:00013188
2023-01-16T04:00:12Z
312:313:314
Text-Style Conversion of Speech Transcript into Web Document for Lecture Archive
Ito, Masashi
Ohno, Tomohiro
Matsubara, Shigeki
open access
Copyright (C) 2009 Fuji Technology Press Co,. Ltd.
natural languages
spoken language processing
digital archiving
web contents
paraphrasing
It is very significant to the knowledge society to accumulate spoken documents on the web. However, because of the high redundancy of spontaneous speech, the faithfully transcribed text is not readable on an Internet browser, and therefore not suitable as a web document. This paper proposes a technique for converting spoken documents into web documents for the purpose of building a speech archiving system. The technique edits automatically transcribed texts and improves their readability on the browser. The readable text can be generated by applying technology such as paraphrasing, segmentation, and structuring transcribed texts. Editing experiments using lecture data demonstrated the feasibility of the technique. A prototype system of spoken document archiving was implemented to confirm its effectiveness.
Fuji Technology Press
2009-03-25
eng
journal article
VoR
http://hdl.handle.net/2237/15083
https://nagoya.repo.nii.ac.jp/records/13188
1343-0130
Journal of Advanced Computational Intelligence and Intelligent Informatics
13
4
499
505
https://nagoya.repo.nii.ac.jp/record/13188/files/1110824.pdf
application/pdf
82.3 kB
2018-02-20