Text Summarization for Oil and Gas News Article

L. H. Chong; Y. Y. Chen

Commenced in January 2007

Frequency: Monthly

Edition: International

Paper Count: 33156

Text Summarization for Oil and Gas News Article

Authors: L. H. Chong, Y. Y. Chen

Abstract:

Information is increasing in volumes; companies are overloaded with information that they may lose track in getting the intended information. It is a time consuming task to scan through each of the lengthy document. A shorter version of the document which contains only the gist information is more favourable for most information seekers. Therefore, in this paper, we implement a text summarization system to produce a summary that contains gist information of oil and gas news articles. The summarization is intended to provide important information for oil and gas companies to monitor their competitor-s behaviour in enhancing them in formulating business strategies. The system integrated statistical approach with three underlying concepts: keyword occurrences, title of the news article and location of the sentence. The generated summaries were compared with human generated summaries from an oil and gas company. Precision and recall ratio are used to evaluate the accuracy of the generated summary. Based on the experimental results, the system is able to produce an effective summary with the average recall value of 83% at the compression rate of 25%.

Keywords: Information retrieval, text summarization, statistical approach.

Digital Object Identifier (DOI): doi.org/10.5281/zenodo.1060110

Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 1613

References:

[1] Hovey, E.E., Cross-lingual Information Extraction and Automated Text Summarization. Available from http://www.ics.mq.edu.au/~swan/summarization/glossary.htm (Accessed 12th November 2008).
[2] Neto, J.L., Freitas, A.A. and Kaestner, C.A.A. (2002), "Automatic Text Summarization Using a Machine Learning Approach" in Proceedings of the 16th Brazilian Symposium on Artificial Intelligence: Advance in Artificial Intelligence, London, 2002.
[3] Luhn, H. P. (1999), "The Automatic Creation of Literature Abstracts", Advances in Automatic Text Summarization, MIT Press.
[4] Edmundson, H.P., "New Methods in Automatic Extracting", Journal of the ACM (JACM), Vol. 16, Issue. 2, p.264-285.
[5] S.P.Yong, Ahmad I.Z. Abidin and Y.Y. Chen (2005), "A Neural Based Text Summarization System", in Proceedings of the 6th International Conference of DATA MINING, Greece, 2005.
[6] Taeho, J., Marley, L. and Thomas M. G. (2006), "Keyword Extraction from Documents Using a Neural Network Model", in Proceedings of International Conference on Hybrid Information Technology, 2006.
[7] Joachims, T. (1998), "Text Categorization with SupportVector Machines: Learning with Many Relevant Features", in European Conference on Machine Learning (ECML), 1998.
[8] Y.Y. Chen, O.M. Foong, S.P. Yong, and Kurniawan Iwan (2008), "Text Summarization for Oil and Gas Drilling Topic", in Proceedings of World Academy of Scienc, Engineering and Technology, Singapore, 2008.
[9] Yamauchi, Y. and Mukaidono, M. (2000), "Probabilistic inference and Bayesian Theorem Rough Sets", Rough Sets and Current Trends in Computing, Springerlink.
[10] Mani, I, Klein, G., House, D. Hirschman, L., Firmin, T. and Sundheim, B. (2002), "SUMMAC: A Text Summarization Evaluation", Natural Language Processing, vol. 8, p. 43-68.
[11] Makhoul, J., Kubala, F., Schwartz, R. and Weischedel, R. (1999), "Performance Measures for Information Extraction", in Proceedings of DARPA Broadcast News Workshop, 1999.