Extractive Summarizer Construction Techniques : A Survey

Authors

  • Vaishali V. Sarwadnya  Pune Institute of Computer Technology, Pune, Maharashtra, India
  • Sheetal. S. Sonawane  Pune Institute of Computer Technology, Pune, Maharashtra, India

Keywords:

Extractive Summarizer, Feature Extraction, Sentence Scoring, Marathi

Abstract

Manual summarization of large documents of texts is tedious and error prone. Also, the results in such kind of summarization may lead to different results for a particular document. Thus, Automatic text summarization has become important due to the tremendous growth of information and data. It chooses the most informative part of text and forms summaries that reveal the main purpose of the given document. It yields summary produced by summarization system which allows readers to comprehend the content of document instead for reading each and every individual document. So, the overall intention of Text Summarizer is to provide the meaning of text in less words and sentences. Summarization can be categorized as: Abstractive summarization and Extractive summarization. This case study is based on an extractive concept implemented on the studied models. Numerous automatic text summarization systems are handy today for English and other foreign languages. But when it comes to Indian languages, we observe inadequate number of automatic summarizers. Evaluation can be done using quantitative or qualitative approach. This paper describes review of techniques used while constructing extractive summarizers and an approach to construct extractive summarizer for Marathi.

References

  1. Virat V. Giri, Dr.M.M. Math and Dr. U. P. Kulkarni, A Survey of Automatic Text Summarization System for Different Regional Languages in India Bonfring International Journal of Software Engineering and Soft Computing, Vol. 6, Special Issue, October 2016
  2. Sheetal Shimpikar and Sharvari Govilkar, A Survey of Text Summarization Techniques for Different Regional Languages in India, International Journal of Computer Applications, Vol. 165, No. 11, May 2017
  3. Sunitha C, Dr. A Jaya and Amal Ganesh, A Survey of Abstractive Summarization Techniques in Indian Languages, 2016
  4. Hamzah Noori Fejer and Nazlia Omar, Automatic Multi-Document Arabic Text Summarization Using Clustering and Keyphrase Extraction ICIMU IEEE 2014 International Conference,978-1-4799-5423-0.
  5. Deepali K. Gaikwad, Deepali Sawane and C. Namrata Mahender, Rule Based Question Generation for Marathi Text Summarization using Rule Based StemmerIOSR Journal of Computer Engineering (IOSR-JCE), eISSN: 2278-0661. 2015 6Mudassar Majgaonkar and Tanveer Siddiqui, Discovering suffixes: A case study for Marathi Language (IJCSE) International Journal on Computer Science and Engineering Vol. 02, No. 08, 2010, 2716-2720
  6. Aishwarya Sahani, Kaustubh Sarang, Sushmita Umredkar, and Mihir Patil, Automatic Text Categorization of Marathi Language Documents (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 7 (5) , 2016, 2297-2301
  7. Ms. Jayshri Arjun Patil, Ms. Poonam Bhagwandas Godhwani, Review of Name Entity Recognition in Marathi Language IJSART - Volume 2 Issue 6 , June 2016
  8. A report on Text Summarization for Compressed Inverted Indexes and snippets by Mahesh Dangale CS 297 Report July 2013.
  9. Anjali R. Deshpande, Lobo L. M. R. J., Text Summarization using Clustering Technique International Journal of Engineering Trends and Technology (IJETT) - Volume4 Issue8- August 2013
  10. Manpreet kaur, Usvir Kaur, Comparison Between K-Mean and Hierarchical Algorithm Using Query Redirection International Journal of Advanced Research in Computer Science and Software Engineering, Volume 3, Issue 7, July 2013
  11. Feifan Liu, Yang Liu, Exploring Correlation between ROUGE and Human Evaluation on Meeting Summaries IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING
  12. R. Vijaya Lakshmi, Dr. S. Britto Ramesh Kumar, Literature Review: Stemming Algorithms for Indian and Non-Indian Languages International Journal of Advanced Research in Computer Science & Technology IJARCST Volume 2, Issue 3, July-Sept 2014
  13. Pallavi Bagul, Archana Mishra, Prachi Mahajan, Medinee Kulkarni, Gauri Dhopavkar, Rule based POS tagger for Marathi TextInternational Journal of Computer Science and IT technologies, Volume 5, 2014
  14. Pooja Pandey, Dhiraj Ahim, Sharvari Govilkar, Rule based Stemmer using Marathi WordNet for Marathi Language International Journal of Advanced Research in Computer and Communication Engineering, Volume 5, Issue 10, 2016
  15. Rafael Ferreira, Luciano de Souza Cabral, Assessing sentence scoring techniques for extractive text summarizationExpert Systems with Applications, Elsevier 2013
  16. TextRank: Bringing order into texts, Mihalcea, Rada., and Tarau, Paul. (2004), In Conference on empirical methods in natural language processing, Barcelona, Spain.
  17. Rouge: A package for automatic evaluation of summaries, Lin, C. Y. In Text summarization branches out, Proceedings of the ACL-04 workshop (Vol. 8).
  18. "Variations of the Similarity Function of TextRank for Automated Summarization", Federico Barrios, Federico Lopez, Luis Argerich, Rosita Wachenchauzer, arXiv, 2017.

Downloads

Published

2018-06-30

Issue

Section

Research Articles

How to Cite

[1]
Vaishali V. Sarwadnya, Sheetal. S. Sonawane, " Extractive Summarizer Construction Techniques : A Survey, IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 3, Issue 3, pp.2058-2066, March-April-2018.