Automatic Query Expansion for Term Selection with BERT Score and WordNet Semantic Filtering

Authors

  • Rishabh Sachan  M.Tech. Scholar, Department of Computer Science and Engineering, Naraina College of Eng. and Tech., Kanpur, Uttar Pradesh, India
  • Mrs Sony Tripathi  Assistant Professor, Department of Computer Science and Engineering, Naraina College of Eng. and Tech., Kanpur, Uttar Pradesh, India

Keywords:

Information Retrieval, Query Extension, Pseudo Relevance Feedback, WordNet, Word2vec, Query Expansion, Theme Semantic Network

Abstract

In this modern era, with loads and tons of data, extracting a much relevant data has always been the user’s desire. This whole process of retrieving the relevant document is known as information retrieval. Information retrieval system uses query expansion techniques for retrieving the most relevant document for satisfying the user’s requirements. There have been several query expansions introduced few of which are knowledge based, corpus based, and pseudo relevance feedback. They make use of synonyms accordingly user’s query with a purpose of extracting the relevancy and it also screens the documents that contains the search term but has a completely different meaning. A combination of genetic algorithms and fuzzy logic blocks approach is proposed in this work. We also presented a new method that attenuates the power of knowledge based or corpus based techniques.

References

  1. G. W. Furnas, T. Landauer, L. Gomez, and S. Dumais, “The vocabulary problem in human-system communication,” Communications of the ACM, vol. 30, no. 11, pp. 964-971, 1987.
  2. H. Chen, J. Yu, K. Furuse, and N. Ohbo, “Support IR query refinementby partial keyword set,” Proceedings of the second international conference on web information systems engineering, Singapore, pp. 245–253, 2001.
  3. B. Kim, J. Kim, and J. Kim, “Query term expansion and reweighting using term co-occurrence similarity and fuzzy inference,” Proceedings of the joint ninth IFSA world congress and 20th NAFIPS international conference, Vancouver, Canada, pp. 715–720, 2001.
  4. Chang, S. Chen and C. Liau, “A new query expansion method based on fuzzy rules,” Proceedings of the seventh joint conference on AI, Fuzzy system, and Grey system, Taipei, Taiwan, Republic of China, 2003.
  5. B. Yates, and R. Neto, “Modern Information Retrieval,” Addison-Wesley Longman Publishing Co., Inc. Boston, MA, USA, 1999.
  6. Y. Bade, R. Bhat, and P. Borate, “Optimization techniques for improving the performance of information retrieval system,” International Journal of research on advanced technology, vol. 2, no. 2, pp. 263–267, 2014.
  7. K. C. Thompson, “Reducing the risk of query expansion via robust constrained optimization,” Proceeding of the 18th ACM conference on Information and knowledge management, NY, USA, pp. 837–846, 2009.
  8. K. Raman, R. Udupa, P. Bhattacharyya, and A. Bhole, “On improving pseudo-relevance feedback using pseudo-irrelevant documents,” Proceedings of ECIR, pp. 573–576, 2010.
  9. R. White, and G. Marchionini, “Examining the effectiveness of real-time query expansion,” Information Processing and Management, vol. 43, no. 3, pp. 685–704, 2007.
  10. Z. Ye, J. Huang, and H. Lin, “Finding a good query-related topic for boosting pseudo-relevance feedback,” Journal of the association of Information Science and Technology, vol. 62, no. 4, pp. 748–760, 2011.
  11. J. Singh, and A. Sharan, “Context window based co-occurrence approach for improving feedback based query expansion in information retrieval,” International Journal of Information Retrieval, vol. 5, no. 4, pp. 31–45, 2015.
  12. Y. Li, S. Chung, and J. Holt, “Text document clustering based on frequent word meaning sequences,” Data and Knowledge Engineering, vol. 64, no. 1, pp. 381–404, 2008.
  13. J. Aguera, and L. Araujo, “Comparing and combining methods for automatic query expansion,” Advances in natural language processing and applications, research in computing science, vol. 33, pp. 177–188, 2008.
  14. M. Valdivia, M. Galiano, A. Raez, and L. Lopez, “Using information gain to improve multi-modal information retrieval systems,” International Journal on Process Management, vol. 44, no. 3, pp. 1146–1158, 2008.
  15. C. Carpineto, and G. Romano, “A survey of automatic query expansion in information retrieval,” ACM Computer Survey, vol. 44, no. 1, pp. 1–50, 2012.
  16. A. Tomiye, A. Samuel, A. Ijesunor, and I. Udo, “A fuzzy-ontology based information retrieval system for relevance feedback,” International Journal of Computer Science, vol. 18, no. 1, pp. 382-389, 2011.
  17. J. Parapar, M. Quindimil, and A. Barreiro, “Score Distributions for Pseudo Relevance Feedback,” Information Sciences, vol. 273, pp.  171-181, 2014.
  18. S. Robertson, “On term selection for query expansion,” Journal of documentation, vol. 46, no. 4, pp. 359-364, 1990.
  19. J. Swets, “Information retrieval systems,” Journal of Science, vol. 141, pp. 245-250, 1963.
  20. C. Lee, “Fuzzy logic in control systems: Fuzzy logic controller, Parts I and II,” IEEE Transaction on System, Man and Cybernetics, vol. 20, pp. 404–435, 1990.
  21. Y. Gupta, and A. Saini, “A novel Fuzzy-PSO term weighting automatic query expansion approach using semantic filtering,” Knowledge Based System, vol. 136, pp. 97-120, 2017.
  22. Gupta, Yogesh, and Ashish Saini. "A novel term selection based automatic query expansion approach using PRF and semantic filtering." International Journal of Engineering and Advanced Technology 8.C (2019): 130-137.
  23. Singh, Saharan,” Rank fusion and semantic genetic notion based automatic query expansion model.” Swarm and Evolutionary Computation. Vol. 1(38),pp-295-308,Feb 2018.

Downloads

Published

2021-06-30

Issue

Section

Research Articles

How to Cite

[1]
Rishabh Sachan, Mrs Sony Tripathi, " Automatic Query Expansion for Term Selection with BERT Score and WordNet Semantic Filtering" International Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 8, Issue 4, pp.132-142, May-June-2021.