Summarization Method and Timeline Generation of the Tweet

Authors

  • Pooja Patil  Department of Computer Engineering, RSCOE, Tathawade, Savitribai Phule Pune University, Pune, Maharashtra, India
  • Nilamvhatte  Department of Computer Engineering, RSCOE, Tathawade, Savitribai Phule Pune University, Pune, Maharashtra, India
  • Srushti Rajput  Department of Computer Engineering, RSCOE, Tathawade, Savitribai Phule Pune University, Pune, Maharashtra, India
  • Ujjwala Panhalkar  Department of Computer Engineering, RSCOE, Tathawade, Savitribai Phule Pune University, Pune, Maharashtra, India
  • K. V. Deshpande  Department of Computer Engineering, RSCOE, Tathawade, Savitribai Phule Pune University, Pune, Maharashtra, India

Keywords:

Tweet Stream, Continuous Summarization, Tweet Clustering, Summary, Timeline

Abstract

Twitter is the most popular micro blogging web site. More than millions of tweets are posted along twitter every day. Tweets contains huge amount of noisy and redundant data. It is very important to summarize the huge amount of tweets by reducing the size of tweets and removing the noise, for improving the result accuracy. The operations over flood of tweets are not an easy task. There are so many tweets are unrelated, also arrival rate of tweets is fast. To handle these problems, there is a need of efficient and strong summarization algorithm. This algorithm should be flexible with random time duration. For topic evolution system should detect sub-topic and keeps track for any changes occur with the time. To achieve all these goals, proposed system performs three types of operations on tweets, named as clustering of tweets, summarization and topic evaluation over tweeter data. Framework has component is data duplication checking using SHA1 hashing strategy. Framework used clustering procedure it uses EM clustering and compare the EM clustering algorithm with K-means clustering algorithm. After this, tweets are summarized with greedy algorithm, which is more accuracy as compared to traditional summarization algorithm. Finally, the topic is detected for generated summary. Experimental results proves that the proposed system summarize the tweets more accurately and efficiently.

References

  1. Zhenhua Wang and Ke Chen, “on summarization and timeline generation for evolutionary Tweet Streams”, IEEE Transaction [2015].
  2. D. Wen, G. Marshall, “Automatic twitter topic summarization” Computational Science and Engineering (CSE), IEEE 17th Inter- national Conference on, Chengdu, [2014].
  3. D. Gao, R. Zhang and Y. Ouyang, “Sequential summarization: a full view of twitter trending topics,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, Feb. 2014.
  4. J. Mao, X. Wang and A. Zhou, “Challenges and issues in trajectory streams clustering upon a Sliding-Window Model”, 12th Web Information System and Application Conference (WISA), [2015].
  5. M. M. Gaber, B. and S. Krishnaswamy, “Stream AR incremental and active learning with evolving sensory data for activity recognition,” IEEE 24th International Conference on Tools with Artificial Intelligence, Athens, [2012]
  6. S. Krysanova, D. M. Marutschke, and H. Ogawa, “Clustering word co- occurrences with color keywords based on twitter feeds in Japanese and German culture,” International Conference on Culture and Computing (Culture Computing), Kyoto, 2015, pp. 191-192, [2015].
  7. V. Sindhura , Y. Sandeep, “Medical data opinion retrieval on Twitter streaming data,” IEEE International Conference on, Coimbatore, 2015, pp. 1-6 [2015].
  8. W. Feng et al., “STREAMCUBE: Hierarchical Spatio Temporal Hashtag Clustering for Event Exploration Over The Twitter stream,” IEEE 31st International Conference on Data Engineering, Seoul, 2015.
  9. Y. Akamatsu, Y. Yaguchi and K. Naruse, “Visualization of Spread of Topic Words on Twitter Using Stream Graphs and Relational Graphs,” Soft Computing and Intelligent Systems (SCIS), 15th International Symposium on, Kitakyushu, 2014, pp. 761-764[2014].

Downloads

Published

2018-06-30

Issue

Section

Research Articles

How to Cite

[1]
Pooja Patil, Nilamvhatte, Srushti Rajput, Ujjwala Panhalkar, K. V. Deshpande, " Summarization Method and Timeline Generation of the Tweet, International Journal of Scientific Research in Science, Engineering and Technology(IJSRSET), Print ISSN : 2395-1990, Online ISSN : 2394-4099, Volume 4, Issue 8, pp.333-339, May-June-2018.