Document Analysis using Similarity Measures : A Case Study on Text Retrieval System

Suresha M

doi:10.32628/CSEIT172638

Authors

Suresha M Department Of Computer Science, Kuvempu University, India

Keywords:

Document Analysis, Similarity Measures, Text Retrieval.

Abstract

A document is an information container that contains information either in printed format or in handwritten format and document is a medium for transferring knowledge. Human vision is the most accurate language identification system in the world. Within a few seconds of looking at a document, one can determine the language even without deskewing and segmenting the image, while computer vision is not able to match human capability. Today there is an increasing need for automatic language identification with the support of computers. As the world moves from paper to paperless office, more and more communication and storage of documents is performed digitally which facilitates quicker additions, searches and modifications and increases the life of such records.

References

B B Chaudhuri and U Pal, Skew Angle Detection of Digitized Indian Script Documents, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol.19, No.2, 1997.
Cattoni R., Coianiz T., Messelodi S., and Modena M C., 1998. Geometric Layout Analysis Techniques for Document Image Understanding: A Review, ITC - IRST, 1998.
Rangachar Kasturi, Lawrence o Gorman and Venu Govindaraju, Document image analysis: a primer, Sadhana, Vol 22, Part I, pp 3-22, 2002.
Song Mao, Azriel Rosen Feld, and Tapas Kanungo, Document structure analysis algorithms: A literature survey, Electronic Imaging, 2003.
Yu B., and Jain A K., A generic system for form dropout. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 18, No.11, 1996.
Yuan Y Tang, Seong Whan Lee, and Ching Y Suen, Automatic Document Processing: A Survey, Vol 29, No.12, pp 1931-1952, 1996.

Document Analysis using Similarity Measures : A Case Study on Text Retrieval System

Authors

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

How to Cite