2024-03-28T11:51:24Z
https://nagoya.repo.nii.ac.jp/oai
oai:nagoya.repo.nii.ac.jp:00013310
2023-01-16T04:00:24Z
312:313:314
Construction of linefeed insertion rules for lecture transcript and their evaluation
Murata, Masaki
41953
Ohno, Tomohiro
41954
Matsubara, Shigeki
41955
spoken language
sentence analysis
real-time captioning
clause boundaries
speech corpus
linefeed insertion rules
The development of a captioning system that supports the real-time understanding of monologue speech such as lectures and commentaries is required. In monologues, since a sentence tends to be long, each sentence is often displayed in multi lines on the screen. In the case, it is necessary to insert linefeeds into a text so that the text becomes easy to read. This paper proposes a rule-based technique for inserting linefeeds into a Japanese spoken monologue sentence as an elemental technique to generate the readable captions. Our method inserts linefeeds into a sentence by applying the rules based on morphemes, dependencies and clause boundaries. We established the rules by circumstantially investigating the corpus annotated with linefeeds. An experiment using Japanese monologue corpus has shown the effectiveness of our rules.
journal article
Inderscience
2010
application/pdf
International Journal of Knowledge and Web Intelligence
3-4
1
227
242
http://dx.doi.org/10.1504/IJKWI.2010.034189
http://hdl.handle.net/2237/15206
1755-8255
https://nagoya.repo.nii.ac.jp/record/13310/files/murata_Inderscience_IJKWI.pdf
eng
https://doi.org/10.1504/IJKWI.2010.034189
[International Journal of Knowledge and Web Intelligence. 1(3-4)/2010] [http://dx.doi.org/10.1504/IJKWI.2010.034189](c)Inderscience Enterprises Ltd.