Information Diffusion Model in Twitter: A Systematic Literature Review
Abstract
:1. Introduction
- What are the purposes of information diffusion-related research on Twitter?
- What methods have researchers used regarding the information diffusion model on Twitter?
- What metrics from Twitter data do researchers use?
- What measures do researchers use to determine influencer rankings?
2. Related Literature Review
3. Methods
- A.
- (“information diffusion”) OR (“influence analysis”) OR (“influence maximization”) OR (“user influence”);
- B.
- (“social network”) OR (“social media”); and
- C.
- “Twitter”.
3.1. Semi-Automatic Selection
3.2. Manual Selection
- First: We examined the title, abstract, and full text from the filtered articles to find articles conducting a survey or literature review. We removed three articles in the form of surveys, namely two articles from Scopus and one article from Dimensions. Thus, in total, from this stage, we obtained 253 articles.
- Second: We examined the abstract to assess the relevance of the article to our research focus. Based on the abstracts, we discarded a total of 49 out of 253 articles, so we obtained 204 selected articles (hereinafter referred to as “Dataset 1”). Note that the original raw data returned from each digital library came in different formats. The selection results of this article originally had a different data format. Hence, we adjusted the article data for Dimensions, Science Direct, and Google Scholar in such a way that their formats were uniform to the raw file from Scopus. After restructuring all datasets into a homogeneous structure, bibliometric analysis was carried out for Dataset 1 (see Section 4).
- Third: We thoroughly read the full text and the content and discussion of the articles to further evaluate their relevance. At this point, we obtained 34 articles (henceforth referred to as “Dataset 2”), which were used further for our systematic literature review analysis.
3.3. Bibliometric Analysis
4. Results
4.1. Results from Bibliometric Analysis
4.1.1. Visualization of the Co-Authorship–Author and Co-Authorship–Country Relations
4.1.2. Visualization of Co-Occurrence–Word Relation
4.1.3. Thematic Evolution
4.2. Results from Systematic Literature Review
4.2.1. The Purpose of Research on Information Diffusion on Twitter
- Information Difference Model on Twitter—articles that focus on modeling how information diffuses or spreads on Twitter;
- Influential User on Twitter—articles that discuss how to find the most influential users on Twitter or to rank Twitter users; and
- Influence Maximization on Twitter—articles that discuss how to maximize the influence of users who share information on Twitter.
4.2.2. Methods Used in Information Diffusion Modeling on Twitter
4.2.3. Twitter Metrics
4.2.4. Measures for Determining Influencers
5. Discussion
5.1. The State-of-the-Art of Information Diffusion Application on Twitter
5.2. Research Gaps
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Ghani, N.A.; Hamid, S.; Targio Hashem, I.A.; Ahmed, E. Social media big data analytics: A survey. Comput. Hum. Behav. 2019, 101, 417–428. [Google Scholar] [CrossRef]
- Fernando, S.; Amador Díaz López, J.; Şerban, O.; Gómez-Romero, J.; Molina-Solana, M.; Guo, Y. Towards a large-scale twitter observatory for political events. Futur. Gener. Comput. Syst. 2019, 110, 976–983. [Google Scholar] [CrossRef]
- Ruz, G.A.; Henríquez, P.A.; Mascareño, A. Sentiment analysis of Twitter data during critical events through Bayesian networks classifiers. Futur. Gener. Comput. Syst. 2020, 106, 92–104. [Google Scholar] [CrossRef]
- Hoeber, O.; Hoeber, L.; El Meseery, M.; Odoh, K.; Gopi, R. Visual Twitter Analytics (Vista): Temporally changing sentiment and the discovery of emergent themes within sport event tweets. Online Inf. Rev. 2016, 40, 25–41. [Google Scholar] [CrossRef]
- Ibrahim, N.F.; Wang, X. A text analytics approach for online retailing service improvement: Evidence from Twitter. Decis. Support Syst. 2019, 121, 37–50. [Google Scholar] [CrossRef] [Green Version]
- Radianti, J.; Hiltz, S.R.; Labaka, L. An overview of public concerns during the recovery period after a major earthquake: Nepal twitter analysis. In Proceedings of the 49th Hawaii International Conference on System Sciences (HICSS), Koloa, HI, USA, 5–8 January 2016; pp. 136–145. [Google Scholar]
- Zahra, K.; Imran, M.; Ostermann, F.O. Automatic identification of eyewitness messages on twitter during disasters. Inf. Processing Manag. 2020, 57, 102107. [Google Scholar] [CrossRef]
- Sakaki, T.; Okazaki, M.; Matsuo, Y. Earthquake shakes Twitter users: Real-time event detection by social sensors. In Proceedings of the 19th International World Wide Web Conference, Raleigh, NC, USA, 26–30 April 2010; pp. 851–860. [Google Scholar]
- Bolzern, P.; Colaneri, P.; De Nicolao, G. Opinion influence and evolution in social networks: A Markovian agents model. Automatica 2019, 100, 219–230. [Google Scholar] [CrossRef] [Green Version]
- Barnaghi, P.; Ghaffari, P.; Breslin, J.G. Opinion Mining and Sentiment Polarity on Twitter and Correlation between Events and Sentiment. In Proceedings of the 2016 IEEE Second International Conference on Big Data Computing Service and Applications (BigDataService), Oxford, UK, 29 March–1 April 2016; pp. 52–57. [Google Scholar]
- Li, J.; Peng, W.; Li, T.; Sun, T.; Li, Q.; Xu, J. Social network user influence sense-making and dynamics prediction. Expert Syst. Appl. 2014, 41, 5115–5124. [Google Scholar] [CrossRef]
- Varshney, D.; Kumar, S.; Gupta, V. Predicting information diffusion probabilities in social networks: A Bayesian networks based approach. Knowl. Based Syst. 2017, 133, 66–76. [Google Scholar] [CrossRef]
- Mostafa, M.M. Information Diffusion in Halal Food Social Media: A Social Network Approach. J. Int. Consum. Mark. 2021, 33, 471–491. [Google Scholar] [CrossRef]
- Kumar, S.; Saini, M.; Goel, M.; Aggarwal, N. Modeling Information Diffusion in Online Social Networks Using SEI Epidemic Model. Procedia Comput. Sci. 2020, 171, 672–678. [Google Scholar] [CrossRef]
- Kushwaha, A.K.; Kar, A.K.; Ilavarasan, P.V. Predicting information diffusion on twitter a deep learning neural network model using custom weighted word features. Responsib. Des. Implement. Use Inf. Commun. Technol. 2020, 12066, 456. [Google Scholar]
- Peng, S.; Zhou, Y.; Cao, L.; Yu, S.; Niu, J.; Jia, W. Influence analysis in social networks: A survey. J. Netw. Comput. Appl. 2018, 106, 17–32. [Google Scholar] [CrossRef]
- Cha, M.; Haddai, H.; Benevenuto, F.; Gummadi, K.P. Measuring user influence in Twitter: The million follower fallacy. In Proceedings of the International AAAI conference on web and social media, Washington, DC, USA, 23–26 May 2010; Volume 4, pp. 10–17. [Google Scholar]
- Salavati, C.; Abdollahpouri, A. Identifying influential nodes based on ant colony optimization to maximize profit in social networks. Swarm Evol. Comput. 2019, 51, 100614. [Google Scholar] [CrossRef]
- Li, D.; Wang, W.; Jin, C.; Ma, J.; Sun, X.; Xu, Z.; Li, S.; Liu, J. User recommendation for promoting information diffusion in social networks. Phys. Stat. Mech. Its Appl. 2019, 534, 121536. [Google Scholar] [CrossRef]
- Rezaie, B.; Zahedi, M.; Mashayekhi, H. Measuring time-sensitive user influence in Twitter. Knowl. Inf. Syst. 2020, 62, 3481–3508. [Google Scholar] [CrossRef]
- Oo, M.M.; Lwin, M.T. Detecting Influential Users in a Trending Topic Community Using Link Analysis Approach. Int. J. Intell. Eng. Syst. 2020, 13, 178–188. [Google Scholar] [CrossRef]
- Gong, M.; Yan, J.; Shen, B.; Ma, L.; Cai, Q. Influence maximization in social networks based on discrete particle swarm optimization. Inf. Sci. 2016, 367, 600–614. [Google Scholar] [CrossRef]
- Jendoubi, S.; Martin, A.; Liétard, L.; Ben Hadji, H.; Ben Yaghlane, B. Two evidential data based models for influence maximization in Twitter. Knowl. Based Syst. 2017, 121, 58–70. [Google Scholar] [CrossRef] [Green Version]
- Felfli, Z.; George, R.; Shujaee, K.; Kerwat, M. Potential-driven model for influence maximization in social networks. IEEE Access 2020, 8, 189786–189795. [Google Scholar] [CrossRef]
- Arora, A.; Bansal, S.; Kandpal, C.; Aswani, R.; Dwivedi, Y. Measuring social media influencer index-insights from facebook, Twitter and Instagram. J. Retail. Consum. Serv. 2019, 49, 86–101. [Google Scholar] [CrossRef]
- Deborah, A.; Michela, A.; Anna, C. How to quantify social media in fl uencers: An empirical application at the Teatro alla Scala. Heliyon 2019, 5, e01677. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Kumar, S.; Saini, M.; Goel, M.; Panda, B.S. Modeling information diffusion in online social networks using a modified forest-fire model. J. Intell. Inf. Syst. 2020, 56, 355–377. [Google Scholar] [CrossRef] [PubMed]
- Kakar, S.; Mehrotra, M. A review of critical research areas under information diffusion in social networks. Int. J. Adv. Comput. Sci. Appl. 2020, 11, 383–396. [Google Scholar] [CrossRef]
- Razaque, A.; Rizvi, S.; Almiani, M.; Al Rahayfeh, A. State-of-art review of information diffusion models and their impact on social network vulnerabilities. J. King Saud Univ. Comput. Inf. Sci. 2019, 34, 1275–1294. [Google Scholar] [CrossRef]
- Alamsyah, A.; Rahardjo, B. Social Network Analysis Taxonomy Based on Graph Representation. arXiv 2021, arXiv:2102.08888. [Google Scholar]
- Hamzah, M. A Taxonomy of Twitter Data Analytics Techniques. In Proceedings of the 32nd IBIMA Conference, Seville, Spain, 15–16 November 2018. [Google Scholar]
- Li, M.; Wang, X.; Gao, K.; Zhang, S. A survey on information diffusion in online social networks: Models and methods. Information 2017, 8, 118. [Google Scholar] [CrossRef] [Green Version]
- Singh, S.S.; Singh, K.; Kumar, A.; Shakya, H.K.; Biswas, B. A Survey on Information Diffusion Models in Social Networks. In International Conference on Advanced Informatics for Computing Research; Springer: Singapore, 2018; Volume 956, pp. 426–439. [Google Scholar]
- Firdaus, S.N.; Ding, C.; Sadeghian, A. Retweet: A popular information diffusion mechanism—A survey paper. Online Soc. Netw. Media 2018, 6, 26–40. [Google Scholar] [CrossRef]
- Riquelme, F.; González-Cantergiani, P. Measuring user influence on Twitter: A survey. Inf. Processing Manag. 2016, 52, 949–975. [Google Scholar] [CrossRef] [Green Version]
- Kitchenham, B. Procedures for Performing Systematic Reviews; Joint Technical Report; Keele University: Staffordshire, UK, 2004. [Google Scholar]
- Silva, R.L.S.; Neiva, F.W. Systematic Literature Review in Computer Science—A Practical Guide; Technical Report; Federal University of Juiz de Fora: Juiz de Fora, Brazil, 2016. [Google Scholar]
- Ellegaard, O.; Wallin, J.A. The bibliometric analysis of scholarly production: How great is the impact? Scientometrics 2015, 105, 1809–1831. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- van Eck, N.J.; Waltman, L. VOSviewer Manual; Univeristeit Leiden: Leiden, The Netherlands, 2013; Volume 1, pp. 1–53. [Google Scholar]
- Aria, M.; Cuccurullo, C. Bibliometrix: An R-tool for comprehensive science mapping analysis. J. Informetr. 2017, 11, 959–975. [Google Scholar] [CrossRef]
- Callon, M.; Courtial, J.P.; Laville, F. Co-word analysis as a tool for describing the network of interactions between basic and technological research: The case of polymer chemsitry. Scientometrics 1991, 22, 155–205. [Google Scholar] [CrossRef]
- Muhlmeyer, M.; Agarwal, S.; Huang, J. Modeling Social Contagion and Information Diffusion in Complex Socio-Technical Systems. IEEE Syst. J. 2020, 14, 5187–5198. [Google Scholar] [CrossRef]
- Kim, Y.; Seo, J. Detection of Rapidly Spreading Hashtags via Social Networks. IEEE Access 2020, 8, 39847–39860. [Google Scholar] [CrossRef]
- Zheng, Z.; Yang, H.; Fu, Y.; Fu, D.; Podobnik, B.; Stanley, H.E. Factors influencing message dissemination through social media. Phys. Rev. E 2018, 97, 062306. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Foroozani, A.; Ebrahimi, M. Anomalous information diffusion in social networks: Twitter and Digg. Expert Syst. Appl. 2019, 134, 249–266. [Google Scholar] [CrossRef]
- Kawamoto, T. A stochastic model of tweet diffusion on the Twitter network. Phys. Stat. Mech. Its Appl. 2013, 392, 3470–3475. [Google Scholar] [CrossRef] [Green Version]
- Yoo, E.; Rand, W.; Eftekhar, M.; Rabinovich, E. Evaluating information diffusion speed and its determinants in social media networks during humanitarian crises. J. Oper. 2016, 45, 123–133. [Google Scholar] [CrossRef]
- Stieglitz, S.; Dang-Xuan, L. Emotions and information diffusion in social media—Sentiment of microblogs and sharing behavior. J. Manag. Inf. 2013, 29, 217–247. [Google Scholar] [CrossRef]
- Kwon, J.; Han, I.; Kim, B. Effects of source influence and peer referrals on information diffusion in Twitter. Ind. Manag. Data Syst. 2017, 117, 896–909. [Google Scholar] [CrossRef]
- Agarwal, S.; Mehta, S. Effective influence estimation in twitter using temporal, profile, structural and interaction characteristics. Inf. Process. Manag. 2020, 57, 102321. [Google Scholar] [CrossRef]
- Jiang, C.; Chen, Y.; Liu, K.J.R. Evolutionary dynamics of information diffusion over social networks. IEEE Trans. Signal Process. 2014, 62, 4573–4586. [Google Scholar] [CrossRef] [Green Version]
- Shuai, X.; Ding, Y.; Busemeyer, J.; Chen, S.; Sun, Y.; Tang, J. Modeling indirect influence on Twitter. Int. J. Semant. Web Inf. Syst. 2012, 8, 20–36. [Google Scholar] [CrossRef] [Green Version]
- Ho, T.K.T.; Bui, Q.V.; Bui, M. Information Diffusion on Complex Networks: A Novel Approach Based on Topic Modeling and Pretopology Theory. Vietnam J. Comput. Sci. 2019, 6, 285–309. [Google Scholar] [CrossRef] [Green Version]
- Tidke, B.; Mehta, R.; Dhanani, J. Consensus-based aggregation for identification and ranking of top-k influential nodes. Neural Comput. Appl. 2020, 32, 10275–10301. [Google Scholar] [CrossRef]
- Tidke, B.; Mehta, R.; Dhanani, J. Multimodal ensemble approach to identify and rank top-k influential nodes of scholarly literature using Twitter network. J. Inf. Sci. 2020, 46, 437–458. [Google Scholar] [CrossRef]
- Zhang, L. Product information diffusion in a social network. Electron. Commer. Res. 2020, 20, 3–19. [Google Scholar] [CrossRef]
- Bhowmick, A. Temporal Sequence of Retweets Help to Detect Influential Nodes in Social Networks. IEEE Trans. Comput. Soc. Syst. 2019, 6, 441–455. [Google Scholar] [CrossRef]
- Alp, Z.Z.; Öğüdücü, Ş.G. Influence Factorization for identifying authorities in Twitter. Knowl. Based Syst. 2019, 163, 944–954. [Google Scholar] [CrossRef]
- Alp, Z.Z. Identifying topical influencers on twitter based on user behavior and network topology. Knowl. Based Syst. 2018, 141, 211–221. [Google Scholar]
- Qasem, Z.; Jansen, M.; Hecking, T.; Hoppe, H.U. Using attractiveness model for actors ranking in social media networks. Comput. Soc. Networks 2017, 4, 3. [Google Scholar] [CrossRef] [Green Version]
- Hamzehei, A.; Jiang, S.; Koutra, D.; Wong, R.; Chen, F. Topic-based social influence measurement for social networks. Australas. J. Inf. Syst. 2017, 21. [Google Scholar] [CrossRef] [Green Version]
- Simmie, D.; Vigliotti, M.G.; Hankin, C. Ranking twitter influence by combining network centrality and influence observables in an evolutionary model. J. Complex Netw. 2014, 2, 495–517. [Google Scholar] [CrossRef]
- Mittal, D.; Suthar, P.; Patil, M.; Pranaya, P.G.S.; Rana, D.P.; Tidke, B. Social Network Influencer Rank Recommender Using Diverse Features from Topical Graph. Procedia Comput. Sci. 2020, 167, 1861–1871. [Google Scholar] [CrossRef]
- Kanavos, A.; Georgiou, A.; Makris, C. Estimating Twitter Influential Users by Using Cluster-Based Fusion Methods. Int. J. Artif. Intell. Tools 2019, 28, 1960010. [Google Scholar] [CrossRef]
- Sankar, C.P.; Asharaf, S.; Kumar, K.S. Learning from bees: An approach for influence maximization on viral campaigns. PLoS ONE 2016, 11, e0168125. [Google Scholar] [CrossRef] [Green Version]
- Kim, H.; Beznosov, K.; Yoneki, E. A study on the influential neighbors to maximize information diffusion in online social networks. Comput. Soc. Netw. 2015, 2, 3. [Google Scholar] [CrossRef] [Green Version]
Author | SLR? | Search Strategy? | Data Extraction? | Information Diffusion Model? | Influence Analysis? | Metrics and Measures? | Twitter/SN? | Bibliometric Analysis? | Article Time Span |
---|---|---|---|---|---|---|---|---|---|
[28] | × | √ | √ | √ | √ | × | SN | × | 2000–2019 |
[29] | × | × | × | √ | × | × | SN | × | 2002–2017 |
[30] | × | × | × | × | × | √ | SN | × | 1998–2012 |
[31] | × | √ | × | × | × | × | × | 2011–2016 | |
[32] | × | × | × | √ | √ | × | SN | × | 2001–2017 |
[33] | × | × | × | √ | √ | √ | SN | × | 1978–2012 |
[34] | × | × | × | √ | × | √ | × | 2001–2015 | |
[35] | × | × | × | × | √ | √ | × | 1972–2015 | |
Our article | √ | √ | √ | √ | √ | √ | √ | 2000–2021 |
Keywords | Type | Scopus | Science Direct | Dimensions | Google Scholar |
---|---|---|---|---|---|
Keyword 1 | A | 2675 | 850 | 2950 | 5950 |
Keyword 2 | A AND B | 1211 | 367 | 1090 | 110 |
Keyword 3 | A AND B AND C | 199 | 50 | 172 | 3 |
Database | Data Keyword 3 | Semi-Automatic | Manual Selection | ||||||
---|---|---|---|---|---|---|---|---|---|
Duplicate | Survey | Abstract | Full Text | ||||||
Excluded | Included | Excluded | Included | Excluded | Included | Excluded | Included | ||
Scopus | 199 | 0 | 199 | 2 | 197 | 29 | 168 | 142 | 26 |
Science Direct | 50 | 45 | 5 | 0 | 5 | 2 | 3 | 3 | 0 |
Dimensions | 172 | 122 | 50 | 1 | 49 | 17 | 32 | 24 | 8 |
Google Scholar | 3 | 1 | 2 | 0 | 2 | 1 | 1 | 1 | 0 |
Total | 424 | 168 | 256 | 3 | 253 | 49 | 204 | 170 | 34 |
# | Author | Methods | Metrics |
---|---|---|---|
1 | [42] | Ignorant–spreader–counterspreader–recovered (ISCR), Ignorant–spreader–spreader–recovered–recovered (ISSRR), Stochastic ISI Information Model, Stochastic ISR Information Model | - |
2 | [50] | Time decay features cascade model (TDF-C), Time decay features cascade threshold model (TDF-CT) | retweet, quote, reply, like |
3 | [43] | Expectation Maximation-Independent Propagation Model with Susceptible–Infected State (EM-IPSI) | retweet |
4 | [45] | Discrete time random walk and Continuous time random walk (DTRW-CTRW) | mention, retweet |
5 | [44] | SIR model | retweet |
6 | [12] | Bayesian network | tweet, retweet |
7 | [47] | Regression analysis | retweet |
8 | [11] | Homogeneous continuous time Markov process (CTMP) | retweet |
9 | [51] | Evolutionary Game Theory (EGT) | tweet with a specific hashtag |
10 | [46] | A stochastic model | retweet |
11 | [48] | Regression | retweet |
12 | [52] | Quantum q-Attention Model | retweet |
13 | [27] | Modified forest-fire model based on mentioned, similarity score, user activity, topic significance | retweet |
14 | [14] | Susceptible–exposed–infected (SEI) | tweet, retweet, reply, mention |
15 | [53] | Textual-Homo-IC, Textual-Homo-PCM | follow |
16 | [49] | Poisson regression model | retweet, quote |
# | Author | Methods | Metrics | Measures |
---|---|---|---|---|
1 | [20] | Analytic hierarchy process (AHP) | tweet–retweet | analytic hierarchy process (AHP) algorithm |
2 | [54] | Average Consensus Ranking Aggregation (ACRA) and Weighted Average Consensus Ranking Aggregation (WACRA) | retweet, mention, follow | degree centrality, closeness, betweenness, eigenvector, PageRank |
3 | [55] | Weighted multimodal ensemble average influence (WMMEAI) for rank top-k | tweet, follower | degree centrality, closeness, betweenness, eigenvector |
4 | [56] | PageRank and betweenness | retweet, quote, favorite | PageRank, betweenness |
5 | [21] | PageRank | retweet, mention, reply | PageRank |
6 | [57] | Smart Inf (stands for Smart Influencer) algorithm; Classical Metrics to Compute Node Influence PageRank centrality (PR), eigenvector centrality (EV), and betweenness centrality (BW) | Retweet | PageRank, eigen vector, betweenness centrality |
7 | [58] | Influence Factorization | retweet | PageRank |
8 | [59] | Personalized PageRank (PPR) | retweet | PPR |
9 | [60] | T and HT measure | retweet | T and HT |
10 | [61] | Topic-based Social Influence Measure (TSIM) | retweet, mention | PageRank, network centrality |
11 | [62] | Hidden Markov Model (HMM) | retweet, reply, mention | buzz rank, reach buzz rank |
12 | [13] | The Clauset–Newman–Moore community detection algorithm, The Latent Dirichlet Allocation (LDA) method | retweet | reciprocity degree, closeness, betweenness, diameter, modularity, transitivity |
13 | [63] | Aggregation Consensus Rank Algorithm (ACRA) | retweet, mention | indegree, closeness centrality, betweenness centrality, eigenvector centrality |
14 | [64] | Cluster-based fusion techniques | retweet, mention | retweet impact, mention impact, signal strength |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Firdaniza, F.; Ruchjana, B.N.; Chaerani, D.; Radianti, J. Information Diffusion Model in Twitter: A Systematic Literature Review. Information 2022, 13, 13. https://doi.org/10.3390/info13010013
Firdaniza F, Ruchjana BN, Chaerani D, Radianti J. Information Diffusion Model in Twitter: A Systematic Literature Review. Information. 2022; 13(1):13. https://doi.org/10.3390/info13010013
Chicago/Turabian StyleFirdaniza, Firdaniza, Budi Nurani Ruchjana, Diah Chaerani, and Jaziar Radianti. 2022. "Information Diffusion Model in Twitter: A Systematic Literature Review" Information 13, no. 1: 13. https://doi.org/10.3390/info13010013