2024-03-28T08:36:18Z
https://nagoya.repo.nii.ac.jp/oai
oai:nagoya.repo.nii.ac.jp:00007300
2023-01-16T05:13:51Z
435:671:672
Record Extraction Based on User Feedback and Document Selection
ZHANG, Jianwei
20090
ISHIKAWA, Yoshiharu
20091
KITAGAWA, Hiroyuki
20092
In recent years, the research of record extraction from large document data is becoming popular. However there still exist some problems in record extraction. 1) when large document data is used for the target of information extraction, the process usually becomes very expensive. 2) it is also likely that extracted records may not pertain to the user’s interest on the aspect of the topic. To address these problems, in this paper we propose a method to efficiently extract those records whose topics agree with the user’s interest. To improve the efficiency of the information extraction system, our method identifies documents from which useful records are probably extracted. We make use of user feedback on extraction results to find topic-related documents and records. Our experiments show that our system achieves high extraction accuracy across different extraction targets.
Proceedings of the Joint Conference of the 9th Asia-Pacific Web Conference and the 8th International Conference on Web-Age Information Management (APWeb/WAIM07)
journal article
Springer
2007-06
application/pdf
Lecture Notes in Computer Science
574
585
http://hdl.handle.net/2237/8987
0302-9743
https://nagoya.repo.nii.ac.jp/record/7300/files/2007-apweb-zhang.pdf
eng
The original publication is available at www.springerlink.com