Document Retrieval Techniques using Vector Space Model

Authors

  • Tanmayee Tushar Parbat  B.E IT, Dr. Vishwanath Karad MIT World Peace University, Pune, Maharashtra, India
  • Rohan Benhal  BBA IT, Dr. Vishwanath Karad MIT World Peace University, Pune, Maharashtra, India
  • Honey Jain  B.E IT, Dr. Vishwanath Karad MIT World Peace University, Pune, Maharashtra, India

Keywords:

Document Retrieval (IR), Indexing, IR mode, Searching, Vector Space Model (VSM)

Abstract

For thousands of years people have realized the importance of archiving and finding information. With the advent of computers, it became possible to store large amounts of information; and finding useful information from such collections became a necessity. The field of Document Retrieval (DR) was born in the 1950s out of this necessity. Over the last forty years, the field has matured considerably. Several DR systems are used on an everyday basis by a wide variety of users. Information retrieval is become a important research area in the field of computer science. Information retrieval (IR) is generally concerned with the searching and retrieving of knowledge-based information from database. In this paper, we represent the various models and techniques for information retrieval. In this Review paper we are describing different indexing methods for reducing search space and different searching techniques for retrieving a information. We are also providing the overview of traditional IR models.

References

  1. M.François Sy, S.Ranwez, J.Montmain,“User centered and ontology based information Retrieval system for life sciences”, BMC Bioinformatics,2105.
  2. R. Sagayam, S.Srinivasan, S. Roshni, “A Survey of Text Mining: Retrieval, Extraction and Indexing Techniques”, IJCER, sep 2012, Vol. 2 Issue. 5, , PP: 1443-1444,.
  3. Anwar A. Alhenshiri, “Web Information Retrieval and Search Engines Techniques”,2010,Al- Satil journal,PP: 55-92.
  4. D.Hiemstra,P. de Vries, “Relating the new language models of information retrieval to the traditional retrieval models”, published as CTIT technical report TR-CTIT-00-09, May 2000.
  5. Djoerd Hiemstra, “Information Retrieval Models”, published in Goker, A., and Davies, J. Information Retrieval: Searching in the 21st Century. John Wiley and Sons, November 2009,Ltd., ISBN-13: 978-0470027622.
  6. Christos Faloutsos, Douglas W. Oard, “A Survey of Information Retrieval and Filtering Methods”, CS-TR-3514, Aug 1995. “Algorithms for Information Retrieval – Introduction”, Lab module 1.
  7. R. Baeza-Yates and B. Ribeiro-Neto, "Modern Information Retrieval",2009, ACM Press, ISBN: 0-201-39829-X.
  8. S.E. Robertson and K. Sparck Jones. “Relevance weighting of search terms. Journal of the American Society for Information Science”, 1976, 27:129–146.
  9. G. Salton and M.J. McGill, “editors. Introduction to Modern Information Retrieval”. McGraw-Hill ,1983.
  10. H. Turtle, “Inference Networks for Document Retrieval”. Ph.D. thesis, Department of Computer Science,University of Massachusetts, Amherst, MA 01003. Available as COINS Technical Report 90-92, 1990.
  11. C. J. van Rijsbergen. “Information Retrieval. Butterworths”, London,1979.
  12. T. Strzalkowski, L. Guthrie, J. Karlgren, J. and et. “Natural language information retrieval: TREC-5 report”. In Proceedings of the Fifth Text REtrieval Conference (TREC-5), 1997.
  13. Gerard Salton and M. J. McGill. “Introduction to Modern Information Retrieval”. McGraw Hill Book Co.,New York, 1983.
  14. Gerard Salton and Chris Buckley. “Termweighting approaches in automatic text retrieval”. Information Processing and Management, , 1988, 24(5):513–523.
  15. Gerard Salton, editor. “The SMART Retrieval System—Experiments in Automatic Document Retrieval”.Prentice Hall Inc., Englewood Cliffs, NJ, 1971.
  16. N. J. Belkin and W. B. Croft.” Information filtering and information retrieval: Two sides of the same coin? “,Communications of the ACM, 1992,35(12):29–38.

Downloads

Published

2021-10-30

Issue

Section

Research Articles

How to Cite

[1]
Tanmayee Tushar Parbat, Rohan Benhal, Honey Jain, " Document Retrieval Techniques using Vector Space Model " International Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 8, Issue 5, pp.93-99, September-October-2021.