A Review on Automatic Person Attribute Information Extraction and Disambiguation from Unstructured text

Authors

  • Yadnesh Charekar  Computer Department, Marathwada Mitra Mandal's College of Engineering, Savitribai Phule Pune University, Maharashtra, India
  • Ruchita Abhang  Computer Department, Marathwada Mitra Mandal's College of Engineering, Savitribai Phule Pune University, Maharashtra, India
  • Rutvij Joshi  Computer Department, Marathwada Mitra Mandal's College of Engineering, Savitribai Phule Pune University, Maharashtra, India
  • Shreyas Kulkarni  Computer Department, Marathwada Mitra Mandal's College of Engineering, Savitribai Phule Pune University, Maharashtra, India
  • Ila Savant  Assistant Professor, Computer Department, Marathwada Mitra Mandal's College of Engineering, Savitribai Phule Pune University, Maharashtra, India

DOI:

https://doi.org//10.32628/CSEIT2390216

Keywords:

Natural Language Processing (NLP), Named Entity Recognition (NER), Entity Attribute Extraction, Conditional Random Fields (CRF), Deep Learning, SpaCy, LUKE

Abstract

Entity attribute extraction is the process of identifying and extracting attributes, or characteristics, of entities from a given text. The objective is to create a model that can automatically perform person-attribute information extraction from unstructured text. Entity attribute extraction's primary goal is to locate and extract attributes of entities from a supplied text. As a result, information from the unstructured text may now be represented in a structured way. By extracting attributes of entities, a computer program can gain a better understanding of the information contained in the text and can use this information for various purposes such as building a knowledge base or for information retrieval. In this way, entity attribute extraction can help to improve the ability of computer programs to process and understand natural language text. All the essential tools and algorithms are researched and discussed in this paper. This study is divided into two main sections that explore published works and modern tools and technologies working in the field of Entity attribute extraction. It also identifies critical research gaps in the literature under assessment. The gap analysis reveals potential for improved textual event prediction algorithms in the future.

References

  1. Y. Ding, News Article Name Disambiguation Model Based on Reinforcement Learning, 2021 International Conference on Artificial Intelligence, Big Data and Algorithms (CAIBDA), Xi'an, China, 2021, pp. 122-127, DOI:10.1109/CAIBDA53561.2021.00033.
  2. Nguyen, G., Dlugolinský, Š., Laclavík, M., Šeleng, M., Tran, V. (2014). Next Improvement Towards Linear Named Entity Recognition Using Character Gazetteers. In: van Do, T., Thi, H., Nguyen, N. (eds) Advanced Computational Methods for Knowledge Engineering. Advances in Intelligent Systems and Computing, vol 282.,Springer,Cham.DOI:10.1007/978-3-319-06569-4_19
  3. Q. Wang and M. Iwaihara, Deep Neural Architectures for Joint Named Entity Recognition and Disambiguation, 2019 IEEE International Conference on Big Data and Smart Computing (BigComp), Kyoto, Japan,2019,pp.1-4,DOI:10.1109/BIGCOMP.2019.8679233.
  4. K. Zhang, Y. Zhu, W. Gao, Y. Xing and J. Zhou, An Approach for Named Entity Disambiguation with Knowledge Graph, 2018 International Conference on Audio, Language and Image Processing (ICALIP), Shanghai, China, 2018, pp. 138-143, DOI:10.1109/ICALIP.2018.8455418.
  5. L. Ma and W. Liu, An Enhanced Method for Entity Trigger Named Entity Recognition Based on POS Tag Embedding, 2021 IEEE 7th International Conference on Cloud Computing and Intelligent Systems (CCIS), Xi'an, China, 2021, pp. 89-93, DOI:10.1109/CCIS53392.2021.9754614.
  6. H. M. S. J. H. D. P. A. H, Review Paper on Named Entity Recognition and Attribute Extraction using Machine Learning, IJRITCC, vol. 4, no. 11, pp. 41 –, Nov.2016. DOI:10.17762/ijritcc.v4i11.2600
  7. N. Kanya and T. Ravi, Modelings and techniques in Named Entity Recognition-an Information Extraction task, IET Chennai 3rd International on Sustainable Energy and Intelligent Systems (SEISCON 2012), Tiruchengode, 2012, pp. 1-5, DOI:10.1049/cp.2012.2199.
  8. Yaojie Lu, Hongyu Lin, Jialong Tang, Xianpei Han, Le Sun, End-to-End Neural Event Coreference Resolution, Artificial Intelligence. 2022 Feb 1;303:103632. DOI:10.48550/arXiv.2009.08153
  9. Thakare, Abhijeet & Deshpande, P.s. (2018). Automatic Extraction of Attributes and Entities for Product Differentiation. International Journal of Computational Intelligence Systems. 11. 296. DOI:10.2991/ijcis.11.1.23
  10. Zhong, Bei, Jin Liu, Yuanda Du, Yunlu Liaozheng and Jiachen Pu. Extracting Attributes of Named Entity from Unstructured Text with Deep Belief Network. International journal of database theory and application 9 (2016): 187-196.DOI:10.14257/IJDTA.2016.9.5.19
  11. Xue Shi, Yingping Yi, Ying Xiong, Buzhou Tang, Qingcai Chen, Xiaolong Wang, Zongcheng Ji, Yaoyun Zhang, Hua Xu, Extracting entities with attributes in clinical text via joint deep learning, Journal of the American Medical Informatics Association, Volume 26, Issue 12, December 2019, Pages 1584–1591, DOI:10.1093/jamia/ocz158
  12. Liu, S., Chen, Y., Xie, X., Siow, J., & Liu, Y. (2020). Retrieval-Augmented Generation for Code Summarization via Hybrid GNN. International Conference on Learning Representations. DOI:10.48550/arXiv.2006.05405

Downloads

Published

2023-04-30

Issue

Section

Research Articles

How to Cite

[1]
Yadnesh Charekar, Ruchita Abhang, Rutvij Joshi, Shreyas Kulkarni, Ila Savant, " A Review on Automatic Person Attribute Information Extraction and Disambiguation from Unstructured text, IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 9, Issue 2, pp.195-201, March-April-2023. Available at doi : https://doi.org/10.32628/CSEIT2390216