Extractive Summarizer Construction Techniques : A Survey

Vaishali V. Sarwadnya; Sheetal. S. Sonawane

doi:10.32628/CSEIT183152

Authors

Vaishali V. Sarwadnya Pune Institute of Computer Technology, Pune, Maharashtra, India
Sheetal. S. Sonawane Pune Institute of Computer Technology, Pune, Maharashtra, India

Keywords:

Extractive Summarizer, Feature Extraction, Sentence Scoring, Marathi

Abstract

Manual summarization of large documents of texts is tedious and error prone. Also, the results in such kind of summarization may lead to different results for a particular document. Thus, Automatic text summarization has become important due to the tremendous growth of information and data. It chooses the most informative part of text and forms summaries that reveal the main purpose of the given document. It yields summary produced by summarization system which allows readers to comprehend the content of document instead for reading each and every individual document. So, the overall intention of Text Summarizer is to provide the meaning of text in less words and sentences. Summarization can be categorized as: Abstractive summarization and Extractive summarization. This case study is based on an extractive concept implemented on the studied models. Numerous automatic text summarization systems are handy today for English and other foreign languages. But when it comes to Indian languages, we observe inadequate number of automatic summarizers. Evaluation can be done using quantitative or qualitative approach. This paper describes review of techniques used while constructing extractive summarizers and an approach to construct extractive summarizer for Marathi.

References

Virat V. Giri, Dr.M.M. Math and Dr. U. P. Kulkarni, A Survey of Automatic Text Summarization System for Different Regional Languages in India Bonfring International Journal of Software Engineering and Soft Computing, Vol. 6, Special Issue, October 2016
Sheetal Shimpikar and Sharvari Govilkar, A Survey of Text Summarization Techniques for Different Regional Languages in India, International Journal of Computer Applications, Vol. 165, No. 11, May 2017
Sunitha C, Dr. A Jaya and Amal Ganesh, A Survey of Abstractive Summarization Techniques in Indian Languages, 2016
Hamzah Noori Fejer and Nazlia Omar, Automatic Multi-Document Arabic Text Summarization Using Clustering and Keyphrase Extraction ICIMU IEEE 2014 International Conference,978-1-4799-5423-0.
Deepali K. Gaikwad, Deepali Sawane and C. Namrata Mahender, Rule Based Question Generation for Marathi Text Summarization using Rule Based StemmerIOSR Journal of Computer Engineering (IOSR-JCE), eISSN: 2278-0661. 2015 6Mudassar Majgaonkar and Tanveer Siddiqui, Discovering sufﬁxes: A case study for Marathi Language (IJCSE) International Journal on Computer Science and Engineering Vol. 02, No. 08, 2010, 2716-2720
Aishwarya Sahani, Kaustubh Sarang, Sushmita Umredkar, and Mihir Patil, Automatic Text Categorization of Marathi Language Documents (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 7 (5) , 2016, 2297-2301
Ms. Jayshri Arjun Patil, Ms. Poonam Bhagwandas Godhwani, Review of Name Entity Recognition in Marathi Language IJSART - Volume 2 Issue 6 , June 2016
A report on Text Summarization for Compressed Inverted Indexes and snippets by Mahesh Dangale CS 297 Report July 2013.
Anjali R. Deshpande, Lobo L. M. R. J., Text Summarization using Clustering Technique International Journal of Engineering Trends and Technology (IJETT) - Volume4 Issue8- August 2013
Manpreet kaur, Usvir Kaur, Comparison Between K-Mean and Hierarchical Algorithm Using Query Redirection International Journal of Advanced Research in Computer Science and Software Engineering, Volume 3, Issue 7, July 2013
Feifan Liu, Yang Liu, Exploring Correlation between ROUGE and Human Evaluation on Meeting Summaries IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING
R. Vijaya Lakshmi, Dr. S. Britto Ramesh Kumar, Literature Review: Stemming Algorithms for Indian and Non-Indian Languages International Journal of Advanced Research in Computer Science & Technology IJARCST Volume 2, Issue 3, July-Sept 2014
Pallavi Bagul, Archana Mishra, Prachi Mahajan, Medinee Kulkarni, Gauri Dhopavkar, Rule based POS tagger for Marathi TextInternational Journal of Computer Science and IT technologies, Volume 5, 2014
Pooja Pandey, Dhiraj Ahim, Sharvari Govilkar, Rule based Stemmer using Marathi WordNet for Marathi Language International Journal of Advanced Research in Computer and Communication Engineering, Volume 5, Issue 10, 2016
Rafael Ferreira, Luciano de Souza Cabral, Assessing sentence scoring techniques for extractive text summarizationExpert Systems with Applications, Elsevier 2013
TextRank: Bringing order into texts, Mihalcea, Rada., and Tarau, Paul. (2004), In Conference on empirical methods in natural language processing, Barcelona, Spain.
Rouge: A package for automatic evaluation of summaries, Lin, C. Y. In Text summarization branches out, Proceedings of the ACL-04 workshop (Vol. 8).
"Variations of the Similarity Function of TextRank for Automated Summarization", Federico Barrios, Federico Lopez, Luis Argerich, Rosita Wachenchauzer, arXiv, 2017.

Extractive Summarizer Construction Techniques : A Survey

Authors

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

How to Cite