Human Digital Twin for Personal Conversation Automation Using Supervised Machine Learning Approaches
Authors: Aya Salama
Abstract:
Digital Twin has emerged as a compelling research area, capturing the attention of scholars over the past decade. It finds applications across diverse fields, including smart manufacturing and healthcare, offering significant time and cost savings. Notably, it often intersects with other cutting-edge technologies such as Data Mining, Artificial Intelligence, and Machine Learning. However, the concept of a Human Digital Twin (HDT) is still in its infancy and requires further demonstration of its practicality. HDT takes the notion of Digital Twin a step further by extending it to living entities, notably humans, who are vastly different from inanimate physical objects. The primary objective of this research was to create an HDT capable of automating real-time human responses by simulating human behavior. To achieve this, the study delved into various areas, including clustering, supervised classification, topic extraction, and sentiment analysis. The paper successfully demonstrated the feasibility of HDT for generating personalized responses in social messaging applications. Notably, the proposed approach achieved an overall accuracy of 63%, a highly promising result that could pave the way for further exploration of the HDT concept. The methodology employed Random Forest for clustering the question database and matching new questions, while K-nearest neighbor was utilized for sentiment analysis.
Keywords: Human Digital twin, sentiment analysis, topic extraction, supervised machine learning, unsupervised machine learning, classification and clustering.
Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 173References:
[1] R. Rosen, G. Wichert, G. Lo, R. Rosen, et al. (2015), About the importance of autonomy and digital twins for the future of manufacturing, IFAC-Papers OnLine 48-3 567–572.
[2] hengli W.(2021), Is Human Digital Twin Possible? Computer Methods and Programs in Biomedicine Update, p. 100014
[3] V. Damjanovic-Behrendt, W. Behrendt, (2019), An open source approach to the design and implementation of digital twins for smart manufacturing, Int. J.Comput. Integr. Manuf., doi:10.1080/0951192X.2019.1599436.
[4] C. Mandolla, A. Messeni, Petruzzelli, al. (2019), Building a digital twin for additive manufacturing through the exploitation of blockchain: a case analysis of the aircraft industry, Comput. Ind. 109, 134–152.
[5] D. Guivarch, E. Mermoz, Y. Marino, et al., (2019), Creation of helicopter dynamic systems digital twin using multibody simulations, doi:10.1016/j.cirp.2019.04.041.
[6] G. Sannino, G. De Pietro,(2019), L. Verde, Healthcare systems: an overview of the most important aspects of current and future m-health applications, doi:10.1007/978-3-030-27844.
[7] S.U. Amin, M. Alsulaiman, G. Muhammad, (2019), Deep learning for EEG motor imagerybased cognitive healthcare, doi:10.1007/978-3-030-27844-1.
[8] A.E. Saddik, H. Badawi, R.A.M. Velazquez, et al., (2019), Dtwins: a digital twins ecosystem for health and well-being, IEEE COMSOC MMTC Commun.–Front. 14 (2).
[9] N.K. Chakshu, J. Carson, I. Sazonov, et al., (2019), A semi-active human digital twin model for detecting severity of carotid stenoses from head vibration - acoupled computational mechanics and computer vision method, doi:10.1002/cnm.3180.
[10] (online) Available: broadbandsearch.net/blog/most-popular-socialnetworking- sites.
[11] (online) Available: https://blog.hootsuite.com/whatsapp-stats/
[12] E. Adamopoulou, L. Moussiades, (2020), Chatbots: History, technology, and applications Machine Learning with Applications, 2, p. 100006,
[13] W. Medhat, A. Hassan and H. Korashy, (2014), ”Sentiment analysis algorithms and applications: A survey”, Ain Shams engineering journal, vol. 5, no. 4, pp. 1093-1113.
[14] Akhtar, M. S., Kumar, A., Ghosal, D., Ekbal, A., Bhattacharyya, P. (2017). A multilayer perceptron based ensemble technique for fine-grained financial sentiment analysis. In Proceedings of the Conference on Empirical Methods on Natural Language Processing.
[15] dos Santos, C. N., Gatti, M. (2014). Deep convolutional neural networks for sentiment analysis for short texts. In Proceedings of the International Conference on Computational Linguistics.
[16] J. serrano, F. Romero and J. olives, (2021), Ordered Weighted Averaging for Emotion-Driven Polarity Detection, Serrano-Guerrero J, Romero FP, Olivas JA. Ordered weighted averaging for emotion-driven polarity detection. Cognit Comput..
[17] P. Turney., (2000), Learning Algorithms for Keyphrase Extraction. Information Retrieval, 2(4):303–336.
[18] Tomokiyo, Takashi; Hurst, Matthew, (2003), A language model approach to keyphrase extraction, In Proceedings of the ACL 2003 workshop on Multiword expressions: analysis, acquisition and treatment, Vol. 18. Association for Computational Linguistics, Stroudsburg, PA, USA, 33-40.
[19] Jain, S., Pareek, J., (2009), KeyPhrase Extraction Tool (KET) for Semantic Metadata Annotation of Learning Materials,” 2009 International Conference on Signal Processing Systems, vol., no., pp.625-628, 15-17.
[20] Wartena, C., Brussee, R., (2008), Topic Detection by Clustering Keywords,” Database and Expert Systems Application, 19th International Workshop, pp.54-58, 1-5,
[21] (online) Available: Edward Loper and Steven Bird. Nltk: The natural language toolkit, 2002.
[22] Ranjan B, Schmidt F, Sun W, Park J, Honardoost MA, Tan J, et al., (2021), combining supervised and unsupervised clustering for cell type identification in single-cell RNA sequencing data. BMC Bioinform.; 22(1):186.
[23] D. A. Adeniyi, Z. Wei and Y. Yongquan, (2016), Automated Web usage data mining and recommendation system using K-nearest neighbor (KNN) classification method, Appl. Comput. Inform., vol. 12, no. 1, pp. 90-108.
[24] J. Dhaliwal, L. Erdman, E. Drysdale, F. Rinawi, J. Muir, T. D. Walters, I. Siddiqui, A. M. Griffiths, and P. C. Church, (2021), Accurate Classification of Pediatric Colonic Inflammatory Bowel Disease Subtype Using a Random Forest Machine Learning Classifier, Journal of Pediatric Gastroenterology Nutrition, vol. 72, no. 2, pp. 262–269.
[25] Goadrich, M., Oliphant, L., Shavlik, J. (2004). Learning ensembles of first-order clauses for recall-precision curves: A case study in biomedical information extraction. Proceedings of the 14th International Conference on Inductive Logic Programming (ILP). Porto, Portugal.
[26] Raghavan, V., Bollmann, P., Jung, G. S. (1989). A critical investigation of recall and precision as measures of retrieval system performance. ACM Trans. Inf. Syst., 7, 205–229.
[27] Markus, B. H., Lancianese, S. L., Nagarajan, M. B., Ikpot, I. Z., Lerner, A. L., Wism, A. (2011) Prediction of Biomechanical Properties of Trabecular Bone in MR Images With Geometric Features and Support Vector Regression, IEEE Transactions on Biomedical Engineering, 58, 6, 1820-1826.