Machine Learning Approach for Question Answering in Information Retrieval

Authors

  • G. N. Prithy  Department of Computer Science and Engineering, R. M. K Engineering College R. S. M. Nagar, Kavaraipettai, Tamil Nadu, India
  • S. Selvi  Department of Computer Science and Engineering, R. M. K Engineering College R. S. M. Nagar, Kavaraipettai, Tamil Nadu, India

Keywords:

CQA, Natural Language Processing, Mining.

Abstract

CQA (Community Question Answering) is a major challenge nowadays due to the popularity and advantages of CQA archives over the web. This paper deals with the methods to solve lexical gap problem in question retrieval and providing multiple domain based CQA archives. The aim of question answering in CQA is to find the existing questions that are similar to the question being asked but this has become a big challenge due to the lexical gap problem. In this paper, we have proposed to learn the word embedding and the category under which the question is being asked by using natural language processing. Three methods are being used. One is local mining and the other is global mining. In local mining the question s answer is checked in the local database, if found the answer is retrieved back else the global mining process starts in which the answer is checked in other sites information and the relevant answer is retrieved back. Third concept is the expert level where the answer for the query is got from an expert.

References

  1. Guangyou Zhou and Jimmy Xiangji Huang, Modelling and learning distributed word representation with metadata for question retrival , in proceedings of IEEE Transactions
  2. on Knowledge and Data Engineering ( Volume: 29, Issue: 6, June 1 2017 ).
  3. X. Xue, J. Jeon, and W. B. Croft, Retrieval models for question and answer archives, in Proceedings of the SIGIR, 2008, pp. 475 482.
  4. L. A. Adamic, J. Zhang, E. Bakshy, and M. S. Ackerman, Knowledge sharing and yahoo answers: Everyone knows something, in Proceedings of the WWW, 2008, pp. 665 674.
  5. H. Duan, Y. Cao, C.-Y. Lin, and Y. Yu, Searching questions by identifying question topic and question focus, in Proceedings of ACL, 2008, pp. 156 164.
  6. J.-T. Lee, S.-B. Kim, Y.-I. Song, and H.-C. Rim, Bridging lexical gaps between queries and questions on large online q&a collections with compact translation models, in Proceedings of the EMNLP, 2008, pp. 410 418.
  7. D. Bernhard and I. Gurevych, Combining lexical semantic resources with question & answer archives for translation-based answer finding, in Proceedings of the ACL, 2009, pp. 728 736.
  8. L. Cai, G. Zhou, K. Liu, and J. Zhao, Learning the latent topics for question retrieval in community QA, in Proc. 5th Int. Joint Conf. Natural Language Process., 2011, pp. 273 281..
  9. K. Zhang, W. Wu, H. Wu, Z. Li, and M. Zhou, Question retrieval with high quality answers in community question answering, in Proceedings of the CIKM, 2014, pp. 371 380.
  10. S. Robertson, S.Walker, S. Jones, M. Hancock-Beaulieu and M. Gatford, Okapi at TREC-3, in Proceedings of TREC, 1994, pp. 109 126.
  11. G. Zhou, Y. Liu, F. Liu, D. Zeng, and J. Zhao, Improving question retrieval in community question answering using world knowledge, in Proc. 23rd Int. Joint Conf. Artif. Intell., 2013, pp. 2239 2245.
  12. J. Jeon, W. B. Croft, and J. H. Lee, Finding similar questions in large question and answer archives, in Proceedings of the CIKM, 2005, pp.84 90 [12]X. Cao, G. Cong, B. Cui, C. S. Jensen, and Q. Yuan, "Approaches to exploring category information for question retrieval in community question- answer archives," ACM Trans. Inf. Syst., vol. 30, no. 2, 2012, Art. no. 7.
  13. Z. Ji, F. Xu, and B. Wang, "A category-integrated language model for question retrieval in community question answering," in Proc. 8th Asia Inf. Retrieval Societies Conf., 2012, pp. 14-25.
  14. A. L. Maas, R. E. Daly, P. T. Pham, D. Huang, A. Y. Ng, and C. Potts, "Learning word vectors for sentiment analysis," in Proc. 49th Annual Meeting Assoc. Computer. Linguistics, 2011, pp. 142-150.
  15. J. Turian, L. Ratinov, and Y. Bengio, "Word representations: A simple and general method for semi-supervised learning," in Proc. 48th Annual. Meeting Assoc. Computer. Linguistics, 2010, pp. 384-394.
  16. C. Xu, et al., "RC-NET: A general framework for incorporating knowledge into word representations," in Proc. 23rd ACM Int. Conf. Inf. Knowl. Manage., 2014, pp. 1219-1228.
  17. M. Yu and M. Dredze, "Improving lexical embeddings with semantic knowledge," in Proc. 52nd Annu. Meeting Assoc. Comput. Linguistics, 2014, pp. 545-550.
  18. X. Zheng, H. Chen, and T. Xu, "Deep learning for Chinese word segmentation and POS tagging," in Proc. Conf. Empirical Methods Natural Language Process., 2013, pp. 647-657.
  19. R. Collobert, J. Weston, L.Bottou, M.Karlen, K Kavukcuoglu,and P.P.Kuksa "Natural language processing (almost) from scratch," J. Mach. Learn. Res. , vol. 12, pp. 2493-2537, 2011.
  20. M. Bansal, K. Gimpel, and K. Livescu, "Tailoring continuous word representations for dependency parsing," in Proc. 52nd Annu. Meeting Assoc. Comput. Linguistics, 2014, pp. 809-815.
  21. R. Socher, J. Pennington, E. H. Huang, A. Y. Ng, and C. D. Manning, "Semi-supervised recursive auto encoders for predicting sentiment distributions," in Proc. Conf. Empirical Methods Natural Language Process., 2011, pp. 151-161.

Downloads

Published

2018-02-28

Issue

Section

Research Articles

How to Cite

[1]
G. N. Prithy, S. Selvi, " Machine Learning Approach for Question Answering in Information Retrieval, IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 3, Issue 1, pp.1549-1556, January-February-2018.