Multi-Topic Tweet Stream Summarization Based on Tweet Vector Clustering

Authors

  • Yashashri Pahade  BE Scholar, Department of Computer Science & Engineering Priyadarshini Bhagwati College of Engineering, Nagpur, Maharashtra, India
  • Prajakta Bhagat  BE Scholar, Department of Computer Science & Engineering Priyadarshini Bhagwati College of Engineering, Nagpur, Maharashtra, India
  • Payal Bele  BE Scholar, Department of Computer Science & Engineering Priyadarshini Bhagwati College of Engineering, Nagpur, Maharashtra, India
  • Rohini Talokar  BE Scholar, Department of Computer Science & Engineering Priyadarshini Bhagwati College of Engineering, Nagpur, Maharashtra, India
  • Prof. Dinesh V. Jamthe  Assistant Professor, Department of Computer Science & Engineering, Priyadarshini Bhagwati College of Engineering, Nagpur, Maharashtra, India

Keywords:

Tweet Stream, Continuous Summarization, Tweet Clustering, Summary, Timeline

Abstract

Immense volume of short messages that is tweets are being shared among various clients and information on long range informal communication locales and microblogging destinations, for example, Twitter, Facebook and so forth. Twitter gets more than 400 million tweets for every day. Constant examination is extremely troublesome and testing undertaking on such gigantic information likewise questioning and recovery of information is additionally troublesome. Such a huge number of tweets contain colossal measure of commotion and repetition. Existing frameworks were generally chipped away at the static and the constrained information. The different existing frameworks were proposed to address these issues and furthermore they gave some arrangement. Summarization is the way toward involving a content document in such way that short summary produced by using the essential keywords of the first document. There is need of dynamic way to deal with condense information delivered by Twitter feeds. This paper proposes the novel method, which produce the significant substance based summery inside less measure of time. Especially, in the proposed framework multi-subject summarization is performed on the online dataset which thusly require the less measure of time when contrasted with the other existing framework. So time productivity is improved by using the proposed framework.

References

  1. Prof R. Mihalcea and P. Tarau, "TextRank: Bringing order into texts," in EMNLP. Barcelona: ACL, 2004, pp. 404–411.
  2. David Inouye and Jugal K. Kalita, "Comparing Twitter Summarization Algorithms for Multiple Post Summaries", IEEE Trans. Knowl. Data Eng., 23(8):1200–1214, 2011.
  3. R. Yan, X. Wan, J. Otterbacher, L. Kong, X. Li, and Y. Zhang, "Evolutionary timeline summarization: A balanced optimization framework via iterative substitution," in Proc. 34th Int. ACM SIGIR Conf. Res. Develop. Inf. Retrieval, 2011, pp. 745–754.
  4. Mitsumasa Kubo, Ryohei Sasano, Hiroya Takamura, and Manabu Okumura, "Generating Live Sports Updates from Twitter by Finding Good Reporters," in IEEE,2013.
  5. T. Zhang, R. Ramakrishnan, and M. Livny, "BIRCH: An efficient data clustering method for very large databases," in Proc. ACM SIGMOD Int. sConf. Manage. Data, 1996, pp. 103–114.
  6. Arkaitz Zubiaga, Damiano Spina, Enrique Amigó and Julio Gonzalo,"Towards Real-Time Summarization of Scheduled Events from Twitter Streams", in Proc. 23rd ACM Conf. Hypertext Social Media,2012.
  7. C. Shen, F. Liu, F. Weng, and T. Li, "A participant-based approach for event summarization using twitter streams," in Proc. Human Lang. Technol. Annu. Conf. North Amer. Chapter Assoc. Comput. Linguistics, 2013, pp. 1152–1162.
  8. G. Erkan and D. Radev, "Lexrank: graph-based centrality as salience in text summarization," Journal of Artificial Intelligence Research, vol. 22, pp. 457–480, 2004.
  9. Zhenhua Wang, Lidan Shou, Ke Chen, "On Summarization and Timeline Generation for Evolutionary Tweet Streams", IEEE Transaction On Knowledge And Data Engineering, Vol. 27, No. 5, May 2015.

Downloads

Published

2018-04-30

Issue

Section

Research Articles

How to Cite

[1]
Yashashri Pahade, Prajakta Bhagat, Payal Bele, Rohini Talokar, Prof. Dinesh V. Jamthe, " Multi-Topic Tweet Stream Summarization Based on Tweet Vector Clustering, IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 3, Issue 4, pp.50-56, March-April-2018.