Automatic Query Expansion for Term Selection with BERT Score and WordNet Semantic Filtering
Keywords:
Information Retrieval, Query Extension, Pseudo Relevance Feedback, WordNet, Word2vec, Query Expansion, Theme Semantic NetworkAbstract
In this modern era, with loads and tons of data, extracting a much relevant data has always been the user’s desire. This whole process of retrieving the relevant document is known as information retrieval. Information retrieval system uses query expansion techniques for retrieving the most relevant document for satisfying the user’s requirements. There have been several query expansions introduced few of which are knowledge based, corpus based, and pseudo relevance feedback. They make use of synonyms accordingly user’s query with a purpose of extracting the relevancy and it also screens the documents that contains the search term but has a completely different meaning. A combination of genetic algorithms and fuzzy logic blocks approach is proposed in this work. We also presented a new method that attenuates the power of knowledge based or corpus based techniques.
References
- G. W. Furnas, T. Landauer, L. Gomez, and S. Dumais, “The vocabulary problem in human-system communication,” Communications of the ACM, vol. 30, no. 11, pp. 964-971, 1987.
- H. Chen, J. Yu, K. Furuse, and N. Ohbo, “Support IR query refinementby partial keyword set,” Proceedings of the second international conference on web information systems engineering, Singapore, pp. 245–253, 2001.
- B. Kim, J. Kim, and J. Kim, “Query term expansion and reweighting using term co-occurrence similarity and fuzzy inference,” Proceedings of the joint ninth IFSA world congress and 20th NAFIPS international conference, Vancouver, Canada, pp. 715–720, 2001.
- Chang, S. Chen and C. Liau, “A new query expansion method based on fuzzy rules,” Proceedings of the seventh joint conference on AI, Fuzzy system, and Grey system, Taipei, Taiwan, Republic of China, 2003.
- B. Yates, and R. Neto, “Modern Information Retrieval,” Addison-Wesley Longman Publishing Co., Inc. Boston, MA, USA, 1999.
- Y. Bade, R. Bhat, and P. Borate, “Optimization techniques for improving the performance of information retrieval system,” International Journal of research on advanced technology, vol. 2, no. 2, pp. 263–267, 2014.
- K. C. Thompson, “Reducing the risk of query expansion via robust constrained optimization,” Proceeding of the 18th ACM conference on Information and knowledge management, NY, USA, pp. 837–846, 2009.
- K. Raman, R. Udupa, P. Bhattacharyya, and A. Bhole, “On improving pseudo-relevance feedback using pseudo-irrelevant documents,” Proceedings of ECIR, pp. 573–576, 2010.
- R. White, and G. Marchionini, “Examining the effectiveness of real-time query expansion,” Information Processing and Management, vol. 43, no. 3, pp. 685–704, 2007.
- Z. Ye, J. Huang, and H. Lin, “Finding a good query-related topic for boosting pseudo-relevance feedback,” Journal of the association of Information Science and Technology, vol. 62, no. 4, pp. 748–760, 2011.
- J. Singh, and A. Sharan, “Context window based co-occurrence approach for improving feedback based query expansion in information retrieval,” International Journal of Information Retrieval, vol. 5, no. 4, pp. 31–45, 2015.
- Y. Li, S. Chung, and J. Holt, “Text document clustering based on frequent word meaning sequences,” Data and Knowledge Engineering, vol. 64, no. 1, pp. 381–404, 2008.
- J. Aguera, and L. Araujo, “Comparing and combining methods for automatic query expansion,” Advances in natural language processing and applications, research in computing science, vol. 33, pp. 177–188, 2008.
- M. Valdivia, M. Galiano, A. Raez, and L. Lopez, “Using information gain to improve multi-modal information retrieval systems,” International Journal on Process Management, vol. 44, no. 3, pp. 1146–1158, 2008.
- C. Carpineto, and G. Romano, “A survey of automatic query expansion in information retrieval,” ACM Computer Survey, vol. 44, no. 1, pp. 1–50, 2012.
- A. Tomiye, A. Samuel, A. Ijesunor, and I. Udo, “A fuzzy-ontology based information retrieval system for relevance feedback,” International Journal of Computer Science, vol. 18, no. 1, pp. 382-389, 2011.
- J. Parapar, M. Quindimil, and A. Barreiro, “Score Distributions for Pseudo Relevance Feedback,” Information Sciences, vol. 273, pp. 171-181, 2014.
- S. Robertson, “On term selection for query expansion,” Journal of documentation, vol. 46, no. 4, pp. 359-364, 1990.
- J. Swets, “Information retrieval systems,” Journal of Science, vol. 141, pp. 245-250, 1963.
- C. Lee, “Fuzzy logic in control systems: Fuzzy logic controller, Parts I and II,” IEEE Transaction on System, Man and Cybernetics, vol. 20, pp. 404–435, 1990.
- Y. Gupta, and A. Saini, “A novel Fuzzy-PSO term weighting automatic query expansion approach using semantic filtering,” Knowledge Based System, vol. 136, pp. 97-120, 2017.
- Gupta, Yogesh, and Ashish Saini. "A novel term selection based automatic query expansion approach using PRF and semantic filtering." International Journal of Engineering and Advanced Technology 8.C (2019): 130-137.
- Singh, Saharan,” Rank fusion and semantic genetic notion based automatic query expansion model.” Swarm and Evolutionary Computation. Vol. 1(38),pp-295-308,Feb 2018.
Downloads
Published
Issue
Section
License
Copyright (c) IJSRCSEIT

This work is licensed under a Creative Commons Attribution 4.0 International License.