Automatic Enhanced Update Summary Generation System for News Documents
Authors: S. V. Kogilavani, C. S. Kanimozhiselvi, S. Malliga
Abstract:
Fast changing knowledge systems on the Internet can be accessed more efficiently with the help of automatic document summarization and updating techniques. The aim of multi-document update summary generation is to construct a summary unfolding the mainstream of data from a collection of documents based on the hypothesis that the user has already read a set of previous documents. In order to provide a lot of semantic information from the documents, deeper linguistic or semantic analysis of the source documents were used instead of relying only on document word frequencies to select important concepts. In order to produce a responsive summary, meaning oriented structural analysis is needed. To address this issue, the proposed system presents a document summarization approach based on sentence annotation with aspects, prepositions and named entities. Semantic element extraction strategy is used to select important concepts from documents which are used to generate enhanced semantic summary.
Keywords: Aspects, named entities, prepositions, update summary.
Digital Object Identifier (DOI): doi.org/10.5281/zenodo.1099308
Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 2135References:
[1] Dragomir Radev, R., Hongyan Jing, Malgorzata Stys and Daniel Tam, “Centroid-based summarization of multiple documents”, International Journal of Information Processing and Management, Vol. 40, pp. 919-938, 2004.
[2] Kirill Kireyev, “Using latent semantic analysis for extractive summarization”, In Proceedings of Text Analysis Conference, 2008.
[3] Josef Steinberger and Karel Jezek, “Update Summarization Based on Novel Topic Distribution”, In Proceedings of the 9th ACM Symposium on Document Engineering, pp. 205-213, 2009.
[4] Chong Long, Min-Lie Huang and Xiao-Yan Zhu, “A New Approach for Multi-Document Update Summarization”, Journal of Computer Science and Technology, Vol. 25, No. 4, pp. 739-749, 2010.
[5] Lei Huang and Yanxiang He, “CorrRank: Update Summarization Based on Topic Correlation Analysis”, Lecture Notes in Computer Science, Vol. 6216, pp. 641-648, 2010.
[6] Min Peng, Xiaoxiao Ma, Ye Tian, Hua Long, Quanchen Lin and Xiaojun Xia, “The Web Information Extraction for Update Summarization Based on Shallow Parsing”, In Proceedings of International Conference on P2P, Parallel, Grid, Cloud and Internet Computing, pp. 109-114, 2011.
[7] Pan Du, Jipeng Yuan, Xianghui Lin, Jin Zhang, Jiafeng Guo and Xueqi Cheng, “Decayed DivRank for Guided Summarization”, In Proceedings of Text Analysis Conference, 2011.
[8] Chandra, M., Gupta, V. and Paul, S.K. “A Statistical Approach for Automatic Text Summarization by Extraction”, In Proceedings of International Conference on Communication Systems and Network Technologies, pp. 268-271, 2011.
[9] Pierre-Etienne Genest, Guy Lapalme, Luka Nerima and Eric Wehrli, “A symbolic Summarizer for the Update Task of TAC 2008”, In Proceedings of Text Analysis Conference, 2008.
[10] Jenny Rose Finkel, Trond Grenager and Christopher Manning, “Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling”, In Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics, pp. 363-370, 2005.
[11] Vivi Nastase, David Milne and Katja Filippova, “Summarizing with Encyclopedic Knowledge”, In Proceedings of Text Analysis Conference, 2009.
[12] Yajie, M. and Li, C. “WikiSummarizer - A Wikipedia-based summarization system”, In Proceedings of the Text Analaysis Conference, 2010.
[13] Christopher Manning, D., Prabhakar Raghavan and Hinrich Schutze, “Introduction to Information Retrieval”, Cambridge University Press, Cambridge, 2008.
[14] Chin-Yew Lin, “ROUGE: A Package for Automatic Evaluation of Summaries”, In Proceedings of the ACL-04 Workshop, pp. 74-81, 2004.