A Relationship Extraction Method from Literary Fiction Considering Korean Linguistic Features

Hee-Jeong Ahn; Kee-Won Kim; Seung-Hoon Kim

Commenced in January 2007

Frequency: Monthly

Edition: International

Paper Count: 32804

A Relationship Extraction Method from Literary Fiction Considering Korean Linguistic Features

Authors: Hee-Jeong Ahn, Kee-Won Kim, Seung-Hoon Kim

Abstract:

The knowledge of the relationship between characters can help readers to understand the overall story or plot of the literary fiction. In this paper, we present a method for extracting the specific relationship between characters from a Korean literary fiction. Generally, methods for extracting relationships between characters in text are statistical or computational methods based on the sentence distance between characters without considering Korean linguistic features. Furthermore, it is difficult to extract the relationship with direction from text, such as one-sided love, because they consider only the weight of relationship, without considering the direction of the relationship. Therefore, in order to identify specific relationships between characters, we propose a statistical method considering linguistic features, such as syntactic patterns and speech verbs in Korean. The result of our method is represented by a weighted directed graph of the relationship between the characters. Furthermore, we expect that proposed method could be applied to the relationship analysis between characters of other content like movie or TV drama.

Keywords: Data mining, Korean linguistic feature, literary fiction, relationship extraction.

Digital Object Identifier (DOI): doi.org/10.5281/zenodo.1112312

Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 1752

References:

[1] D. K. Elson, D. Nicholas and K. R. McKeown, “Extracting social networks from literary fiction”, Proceedings of the 48th annual meeting of the association for computational linguistics, pp. 138-147, Jul 2010
[2] H. He, D. Barbosa and G. Kondrak, “Identification of Speakers in Novels”, ACL (1), pp.1312-1320, Aug 2013
[3] G. M. Park, S. H. Kim and H. G. Cho, “Complex system analysis of social networks extracted from literary fictions” International Journal of Machine Learning and Computing, vol.3, pp. 107-111, Feb 2013.
[4] C. Y. Jung, Y. H. Seo, “Machine Translation of Korean-to-English spoken language Based on Semantic Patterns”, Korea Information Processing Society, vol. 5, issue 9, pp. 2361-2368, 1998.
[5] H. G. Lee, M.S. Choi, H. S. Kim, “One-Class Classification Model Based on Lexical Information and Syntactic Patterns”, Jour nal of KIISE, vol. 42, pp. 817-822, 2015.
[6] H. G. Yoon, S. B. Park, "Pattern and Instance Generation for Self-knowledge Learning in Korean" Journal of Korean Institute of Intelligent Systems, vol.25, pp. 63-69, Jan 2015.
[7] Hannanum, http://semanticweb.kaist.ac.kr/home/index.php/HanNanum, 2014.
[8] S. H. Kim, T. K. Park, S. H. Kim, “A Recognition Method for Main Characters Name in Korean Novels”, The Journal of Korea Institute of Information, Electronics, and Communication Technology, vol. 9, no. 1 pp. 75-98, Feb 2016.