Review on an Enhanced LDA Topic Model Approach for Event Extraction from Twitter

Authors

  • Zarana Patel  PG Student, Department of Computer Engineering, Government Engineering College Gandhinagar, Gandhinagar, Gujarat, India
  • Jitendra Dhobi  Associate Professor, Department of Computer Engineering, Government Engineering College Gandhinagar, Gandhinagar, Gujarat, India

Keywords:

Topic modeling, Event Extraction, Twitter, LDA, Data mining

Abstract

Topic models are powerful tools to identify latent text patterns in the content. They are applied in a wide range of areas including event extraction from Twitter. Twitter, as a popular micro blogging service, has become a new information channel for users to receive and exchange the important information on current events. Tweets recently gain a lot of importance due to its ability of produce information rapidly. Tweets are commonly related to some events. In this paper provide you a review on event extraction from twitter using topic modeling. Based on the study of the researchers LDA is the best topic model for event extraction. Though applying traditional LDA topic model directly on tweets posses two challenges: 1) Data scarceness problem due to the nature of short text length of the tweets. 2) Generated summaries contain words that are somewhat general and independent to the topic that is failed to understand the semantic of twitter data. Event Extraction methods present in this literature address this problem and classify different approach and discuss commonly used features.

References

  1. Dharini Ramachandran, Parvathi R, "Event detection from twitter - a survey Article information?: About Emerald www.emeraldinsight.com Event detection from twitter - a survey," 2018.
  2. Jelodar, Hamed, et al. "Latent Dirchlet Allocation (LDA) and Topic modeling: models, applications, a survey.", arXiv,2017.
  3. D M. Blei, A. Y. Ng, and M. I. Jordan, "Latent Dirichlet Allocation," J. Mach. Learn. Res., vol. 3, no. 3, pp. 993-1022, 2003.
  4. Yang, Shih-Feng, and Julia Taylor Rayz. "An event detection approach based on Twitter hashtags." arXiv preprint arXiv:1804.11243,2018.
  5. F Zarrinkalam and E. Bagheri, "Event identification in social networks," Encycl. with Semant. Comput. Robot. Intell., vol. 01, no. 01, p. 1630002, 2017.
  6. T Sakaki, M. Okazaki, and Y. Matsuo, "Earthquake Shakes Twitter Users: Real-time Event Detection by Social Sensors," Proc. 19th Int. Conf. World Wide Web, pp. 851-860, 2010.
  7. S B. Kaleel and A. Abhari, "Cluster-discovery of Twitter messages for event detection and trending," J. Comput. Sci., vol. 6, pp. 47-57, 2015.
  8. D Godfrey, C. Johns, C. Sadek, and M. L. Aug, "A Case Study in Text Mining?: Interpreting Twitter Data From World Cup Tweets," vol. 5, pp. 1-11.
  9. R Musters, "Topic detection in microblogs using big data and neural networks," Univ. Groningen, no. June, 2017.
  10. G. Ifrim, B. Shi, and I. Brigadir, "Event detection in Twitter using aggressive filtering and hierarchical tweet clustering," CEUR Workshop Proc., vol. 1150, pp. 33-40, 2014.
  11. F. Atefeh and W. Khreich, "A survey of techniques for event detection in Twitter," Comput. Intell., vol. 31, no. 1, pp. 133-164, 2015.
  12. A. Karandikar, "Clustering short status messages?: A topic model based approach," Work, p. 55, 2010.
  13. T. Hofmann, "Probabilistic latent semantic analysis," UAI99 Proc. Fifteenth Conf. Uncertain. Artif. Intell., pp. 289-296, 1999.
  14. K. Krishnamurthi, "Impact of Topic Modelling Methods and Text Classification Techniques in Text Mining?: a Survey," no. 3, pp. 72-77, 2017.
  15. J. Li, Z. Tai, R. Zhang, W. Yu, and L. Liu, "Online Bursty Event Detection from Microblog," 2014.
  16. X. Sun, Y. Wu, L. Liu, and J. Panneerselvam, "Efficient Event Detection in Social Media Data Streams," 2015 IEEE Int. Conf. Comput. Inf. Technol. Ubiquitous Comput. Commun. Dependable, Auton. Secur. Comput. Pervasive Intell. Comput., pp. 1711-1717, 2015.
  17. L. Shi, L. Liu, X. Wu, L. Jiang, and Y. Sun, "Event Detection and Key Posts Discovering in Social Media Data Streams," IEEE Access, vol. 3536, no. c, pp. 1-1, 2017.
  18. L. Shi, L. Liu, Y. Wu, L. Jiang, and J. Hardy, "Event Detection and User Interest Discovering in Social Media Data Streams," IEEE Access, vol. 3536, no. c, pp. 1-1, 2017.
  19. X. Yan, J. Guo, Y. Lan, and X. Cheng, "A biterm topic model for short texts," WWW 13 Proc. 22nd Int. Conf. World Wide Web, pp. 1445-1456, 2013.
  20. W. Li, Y. Feng, D. Li, and Z. Yu, "Micro-blog topic detection method based on BTM topic model and K-means clustering algorithm," Autom. Control Comput. Sci., vol. 50, no. 4, pp. 271-277, 2016.
  21. A. H. Hossny, T. Moschuo, G. Osborne, L. Mitchell, and N. Lothian, "Enhancing keyword correlation for event detection in social networks using SVD and k-means: Twitter case study," Soc. Netw. Anal. Min., vol. 8, no. 1, p. 0, 2018.
  22. X. Yan, J. Guo, S. Liu, X. Cheng, and Y. Wang, "Learning Topics in Short Texts by Non-negative Matrix Factorization on Term Correlation Matrix," Proc. 2013 SIAM Int. Conf. Data Min., pp. 749-757, 2013.
  23. L. Hong and B. D. Davison, "Empirical Study of Topic Modeling in Twitter," pp. 80-88,2010.
  24. R. Mehrotra, S. Sanner, W. Buntine, and L. Xie, "Improving LDA topic models for microblogs via tweet pooling and automatic labeling," Proc. 36th Int. ACM SIGIR Conf. Res. Dev. Inf. Retr. - SIGIR 13, p. 889, 2013.
  25. D. Alvarez-Melis and M. Saveski, "Topic Modeling in Twitter: Aggregating Tweets by Conversations," $Icwsm16, no. Icwsm, pp. 519-522, 2016.
  26. P. Suri and N. R. Roy, "Comparison between LDA & NMF for event-detection from large text stream data," 3rd IEEE Int. Conf., pp. 1-5, 2017.
  27. W. X. Zhao, J. Jiang, J. Weng, J. He, and E. Lim, "Comparing Twitter and Traditional Media Using," pp. 338-349, 2011.
  28. M. Annamalai and S. Farah Nasehah Mukhlis, "Content quality of clustered latent dirichlet allocation short summaries," Lect. Notes Comput. Sci. (including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics), vol. 8870, pp. 494-504, 2014.
  29. M. Hajjem and C. Latiri, "ScienceDirect Combining IR IR and and LDA LDA Topic Topic Modeling Modeling for for Filtering Filtering Microblogs Microblogs," Procedia Comput. Sci., vol. 112, pp. 761-770, 2017.
  30. C. Xing, Y. Wang, J. Liu, Y. Huang, and W. Ma, "Hashtag-Based Sub-Event Discovery Using Mutually Generative LDA in Twitter," Aaai, pp. 2666-2672, 2016.
  31. M. Sokolova et al., "Topic Modelling and Event Identification from Twitter Textual Data," 2016.
  32. M. Gupta and P. Gupta, "Research and implementation of event extraction from twitter using LDA and scoring function," Int. J. Inf. Technol., 2018.
  33. A. Weiler, M. Grossniklaus, and M. H. Scholl, "Evaluation Measures for Event Detection Techniques on Twitter Data Streams," pp. 108-119, 2015.
  34. Hidayatullah, Ahmad Fathan, et al. "Twitter Topic Modeling on Football News." 3rd International Conference on Computer and Communication Systems (ICCCS). IEEE, 2018.

Downloads

Published

2018-10-30

Issue

Section

Research Articles

How to Cite

[1]
Zarana Patel, Jitendra Dhobi, " Review on an Enhanced LDA Topic Model Approach for Event Extraction from Twitter , IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 3, Issue 7, pp.352-358, September-October-2018.