Context-Based Semantic Similarity and Document Retrieval

Authors

  • Tanmay Joshi  Department of Computer Engineering, Pune Institute of Computer Technology, Pune, Maharashtra, India
  • Prof. A. G. Phakatkar  Department of Computer Engineering, Pune Institute of Computer Technology, Pune, Maharashtra, India

DOI:

https://doi.org//10.32628/CSEIT183896

Keywords:

Semantic Web, Ontology/ Taxonomy, Natural Language Processing

Abstract

Text based methods are extensively used in information retrieval on web. There are many different ways in which a statement can be expressed by using various words conveying the same meaning. Also, a single word can mean a lot of different things in various contexts. It is a challenging task to make the system understand what exactly the word or statement means and in which context. Hence, it is an important task to find out the context of the words so as to effectively understand what the user intends to mean which in turn can be used in a variety of NLP and text processing applications. A method is proposed in order to find out the similarity between two words and use the results in order to retrieve documents.

References

  1. P. Li, H. Wang, K. Q. Zhu, Z. Wang, X. Hu, and X. Wu, “Large Probabilistic Semantic Network Based Approach to Compute Term Similarity” in IEEE Transactions on Knowledge and Data Engineering, Vol. 27, No. 10, October 2015
  2. Kavitha Sri.M and Hemalatha.P. “Survey on Text Classification Based on Similarity” in International Journal of Innovative Research in Computer and Communication Engineering, Vol. 3, Issue 3, March 2015
  3. S.S. Kulkarni and K.S.Kadam, “Development of Term Similarity Measurement Using Semantic Network Approach” in International Journal of Advance Engineering and Research Development, Volume 3, Issue 11, November 2016.
  4. P. Ransik. “Using Information Content to Evaluate Semantic Similarity in a Taxonomy”,in Proc. 14th Int. Joint Conf. Artif. Intell.,1995, pp. 448-453.
  5. R. Rada, H. Mili, E. Bicknell, and M. Blettner. “Development and Application of a Metric on Semantic Nets” in IEEE Transactions On Systems. Man. And Cybfrnetics, Vol 19. January 1989.
  6. Q. Do, D. Roth, M. Sammons, Y. Tu and V.G.V. Vydiswaran. “Robust, Light-weight Approaches to compute Lexical Similarity” University of Illinois, Champaign, IL, USA, Comput. Sci. Res. Tech. Rep., 2009
  7. D. Sánchez, M. Batet and D. Isern, “Ontology-based information content computation” Knowledge-Based Systems vol 24, 2011 pp. 297-303.
  8. E. Banerjee and T. Pedersen. “An adapted Lesk Algorithm for Word Sense Disambiguation Using WordNet”, in Proc. 3rd Int. Conf. Comput. Linguistics Intell. Text Process, 2002, pp. 136 – 145.
  9. D. Bollegala, Y. Matsuo, and M. Ishizuka. “A Web Search Engine-Based Approach to Measure Semantic Similarity between Words”, in IEEE Transactions On Knowledge And Data Engineering, Vol. 23, No. 7, July 2011.
  10. D. L. T. Rohde, L. M. Gonnerman, and D. C. Plaut, “An improved model of semantic similarity based on lexical co-occurrence,” in Commun. ACM, vol. 8, pp. 627 - 633, 2005.
  11. J. Kazama, S. D. Saeger, K. Kuroda, M. Murata, and K. Torisawa, “A Bayesian method for robust estimation of distributional similarities” in Proc. 48th Annu. Meeting Assoc. Comput. Linguistics, 2010, pp. 247 - 256.

Downloads

Published

2018-12-30

Issue

Section

Research Articles

How to Cite

[1]
Tanmay Joshi, Prof. A. G. Phakatkar, " Context-Based Semantic Similarity and Document Retrieval , IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 3, Issue 8, pp.398-402, November-December-2018. Available at doi : https://doi.org/10.32628/CSEIT183896