2024-03-29T08:52:09Z
https://nagoya.repo.nii.ac.jp/oai
oai:nagoya.repo.nii.ac.jp:00007300
2023-01-16T05:13:51Z
435:671:672
Record Extraction Based on User Feedback and Document Selection
ZHANG, Jianwei
ISHIKAWA, Yoshiharu
KITAGAWA, Hiroyuki
open access
The original publication is available at www.springerlink.com
In recent years, the research of record extraction from large document data is becoming popular. However there still exist some problems in record extraction. 1) when large document data is used for the target of information extraction, the process usually becomes very expensive. 2) it is also likely that extracted records may not pertain to the user’s interest on the aspect of the topic. To address these problems, in this paper we propose a method to efficiently extract those records whose topics agree with the user’s interest. To improve the efficiency of the information extraction system, our method identifies documents from which useful records are probably extracted. We make use of user feedback on extraction results to find topic-related documents and records. Our experiments show that our system achieves high extraction accuracy across different extraction targets.
Proceedings of the Joint Conference of the 9th Asia-Pacific Web Conference and the 8th International Conference on Web-Age Information Management (APWeb/WAIM07)
Springer
2007-06
eng
journal article
AM
http://hdl.handle.net/2237/8987
https://nagoya.repo.nii.ac.jp/records/7300
0302-9743
Lecture Notes in Computer Science
574
585
https://nagoya.repo.nii.ac.jp/record/7300/files/2007-apweb-zhang.pdf
application/pdf
255.8 kB
2018-02-19