Journal of Data Science logo


Login Register

  1. Home
  2. Issues
  3. Volume 21, Issue 3 (2023): Special Issue: Advances in Network Data Science
  4. Common Growth Patterns for Regional Soci ...

Journal of Data Science

Submit your article Information
  • Article info
  • Related articles
  • More
    Article info Related articles

Common Growth Patterns for Regional Social Networks: A Point Process Approach
Volume 21, Issue 3 (2023): Special Issue: Advances in Network Data Science, pp. 446–469
Tiandong Wang ORCID icon link to view author Tiandong Wang details   Sidney I. Resnick  

Authors

 
Placeholder
https://doi.org/10.6339/21-JDS1021
Pub. online: 16 September 2021      Type: Statistical Data Science      Open accessOpen Access

Received
19 April 2021
Accepted
18 August 2021
Published
16 September 2021

Abstract

In this paper, we study macroscopic growth dynamics of social network link formation. Rather than focusing on one particular dataset, we find invariant behavior in regional social networks that are geographically concentrated. Empirical findings suggest that the startup phase of a regional network can be modeled by a self-exciting point process. After the startup phase ends, the growth of the links can be modeled by a non-homogeneous Poisson process with a constant rate across the day but varying rates from day to day, plus a nightly inactive period when local users are expected to be asleep. Conclusions are drawn based on analyzing four different datasets, three of which are regional and a non-regional one is included for contrast.

Supplementary material

 Supplementary Material
We have collected all relevant codes and datasets, and made them available at https://github.com/tw398/NetworkGrowth. This GitHub repository includes: (1) links to the corresponding dataset on KONECT; (2) cleaned Dutch Wikipedia talk data; (3) all relevant codes for data analyses in the paper; and (4) ACF plots which are not presented in the paper.

References

 
Anderson RM, Anderson B, May RM (1992). Infectious Diseases of Humans: Dynamics and Control. Oxford University Press.
 
Backstrom L, Huttenlocher D, Kleinberg J, Lan X (2006). Group formation in large social networks: Membership, growth, and evolution. In: Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’06, 44–54. Association for Computing Machinery, New York, NY, USA.
 
Bacry E, Mastromatteo I, Muzy JF (2015). Hawkes processes in finance. Market Microstructure and Liquidity, 1(01): 1550005.
 
Bollobás B, Borgs C, Chayes J, Riordan O (2003). Directed scale-free graphs. In: Proceedings of the Fourteenth Annual ACM-SIAM Symposium on Discrete Algorithms. Baltimore. 2003, 132–139. New York. ACM.
 
Daley DJ, Vere-Jones D (2003). An Introduction to the Theory of Point Processes. Vol. I Probability and its Applications (New York). Springer-Verlag, New York, second edition. Elementary theory and methods.
 
Embrechts P, Liniger T, Lin L (2011). Multivariate Hawkes processes: An Application to financial data. Journal of Applied Probability, 48(A): 367–378.
 
Farajtabar M, Du N, Rodriguez MG, Valera I, Zha H, Song L (2014). Shaping social activity by incentivizing users. In: Ghahramani Z, Welling M, Cortes C, Lawrence N, Weinberger KQ (Eds.), Advances in Neural Information Processing Systems, 2474–2482.
 
Goh KI, Barabási AL (2008). Burstiness and memory in complex systems. Europhysics Letters, 81(4): 48002.
 
Hansen NR, Reynaud-Bouret P, Rivoirard V (2015). Lasso and probabilistic inequalities for multivariate point processes. Bernoulli, 21(1): 83–143.
 
Hawkes AG (2018). Hawkes processes and their applications to finance: A review. Quantitative Finance, 18(2): 193–198.
 
Hawkes AG, Oakes D (1974). A cluster process representation of a self-exciting process. Journal of Applied Probability, 11(3): 493–503.
 
Jo HH, Perotti JI, Kaski K, Kertész J (2015). Correlated bursts and the role of memory range. Physical Review E, 92(2): 022814.
 
Kirchner M (2016). Hawkes and INAR (∞) processes. Stochastic Processes and Their Applications, 126(8): 2494–2525.
 
Kirchner M (2017). An estimation procedure for the Hawkes process. Quantitative Finance, 17(4): 571–595.
 
Kirchner M, Bercher A (2018). A nonparametric estimation procedure for the Hawkes process: Comparison with maximum likelihood estimation. Journal of Statistical Computation and Simulation, 88(6): 1106–1116.
 
Kobayashi R, Tideh RL (2016). Time-dependent Hawkes process for predicting retweet dynamics. In: Tenth International AAAI Conference on Web and Social Media.
 
Krapivsky P, Rodgers G, Redner S (2001). Degree distributions of growing networks. Physical Review Letters, 86: 5401–5404.
 
Krapivsky PL, Redner S (2001). Organization of growing random networks. Physical Review E, 63(6): 066123. 1–14.
 
Leskovec J, Kleinberg J, Faloutsos C (2007). Graph evolution: Densification and shrinking diameters. ACM Transactions on Knowledge Discovery from Data, 1(1): 2–es.
 
Mahajan V, Muller E, Bass FM (1990). New product diffusion models in marketing: A review and directions for research. Journal of Marketing, 54(1): 1–26.
 
Martin T, Hofman JM, Sharma A, Anderson A, Watts DJ (2016). Exploring limits to prediction in complex social systems. In: Proceedings of the 25th International Conference on World Wide Web, 683–694. International World Wide Web Conferences Steering Committee.
 
Mishra S, Rizoiu MA, Xie L (2016). Feature driven and point process approaches for popularity prediction. In: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, 1069–1078. ACM.
 
Mislove A, Koppula HS, Gummadi KP, Druschel P, Bhattacharjee B (2008). Growth of the Flickr social network. In: Proceedings of the First Workshop on Online Social Networks, 25–30. ACM.
 
Mohler G (2013). Modeling and estimation of multi-source clustering in crime and security data. Annals of Applied Statistics, 7(3): 1525–1539.
 
Ogata Y (1988). Statistical models for earthquake occurrences and residual analysis for point processes. Journal of the American Statistical Association, 83(401): 9–27.
 
Porter MD, White G (2012). Self-exciting hurdle models for terrorist activity. Annals of Applied Statistics, 6(1): 106–124.
 
Rizoiu MA, Lee Y, Mishra S, Xie L (2017a). A tutorial on Hawkes processes for events in social media. arXiv preprint: https://arxiv.org/abs/1708.06401.
 
Rizoiu MA, Xie L, Sanner S, Cebrian M, Yu H, Van Hentenryck P (2017b). Expecting to be hip: Hawkes intensity processes for social media popularity. In: Proceedings of the 26th International Conference on World Wide Web, 735–744. International World Wide Web Conferences Steering Committee.
 
Srijith PK, Lukasik M, Bontcheva K, Cohn T (2017). Longitudinal modeling of social media with Hawkes process based on users and networks. In: Diesner J, Ferrari E, Xu G (Eds.), Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2017, 195–202. ACM.
 
Sun J, Kunegis J (2016). Wiki-talk datasets. Universität Koblenz Landau.
 
Veen A, Schoenberg FP (2008). Estimation of space-time branching process models in seismology using an EM-type algorithm. Journal of the American Statistical Association, 103(482): 614–624.
 
Viswanath B, Mislove A, Cha M, Gummadi KP (2009). On the evolution of user interaction in Facebook. In: Proceedings of the 2nd ACM Workshop on Online Social Networks, 37–42. ACM.
 
Wan P, Wang T, Davis RA, Resnick SI (2017). Fitting the linear preferential attachment model. Electronic Journal of Statistics, 11(2): 3738–3780.
 
Wang T, Resnick SI. A directed preferential attachment model with Poisson measurement. arXiv preprint: https://arxiv.org/abs/2008.07005.
 
Dutch Wikipedia (2019). https://en.wikipedia.org/wiki/Dutch_Wikipedia. Accessed: 2019-10-27.
 
German Wikipedia (2019). https://en.wikipedia.org/wiki/German_Wikipedia. Accessed: 2019-10-27.
 
Wikipedia, Dutch language – Wikipedia, the free encyclopedia (2019). https://en.wikipedia.org/wiki/Dutch_language Accessed: 2019-10-21.
 
Wikipedia (2019). German language – Wikipedia, the free encyclopedia. https://en.wikipedia.org/wiki/German_language. Accessed: 2019-10-21.
 
Zang C, Cui P, Faloutsos C (2016). Beyond sigmoids: The NetTide model for social network growth, and its applications. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015–2024. ACM.
 
Zang C, Cui P, Faloutsos C, Zhu W (2017). Long short memory process: Modeling growth dynamics of microscopic social connectivity. In: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 565–574. ACM.
 
Zhao Q, Erdogdu MA, He HY, Rajaraman A, Seismic JL (2015). A self-exciting point process model for predicting tweet popularity. In: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 1513–1522. ACM.
 
Zhou K, Zha H, Song L (2013). Learning social infectivity in sparse low-rank networks using multi-dimensional Hawkes processes. In: Carvalho CM, Ravikumar P (Eds.), Proceedings of the Sixteenth International Conference on Artificial Intelligence and Statistics, 641–649.

Related articles PDF XML
Related articles PDF XML

Copyright
2023 The Author(s). Published by the School of Statistics and the Center for Applied Statistics, Renmin University of China.
by logo by logo
Open access article under the CC BY license.

Keywords
estimation network growth non-homogeneous Poisson processes point processes self-exciting processes

Metrics
since February 2021
1257

Article info
views

541

PDF
downloads

Export citation

Copy and paste formatted citation
Placeholder

Download citation in file


Share


RSS

Journal of data science

  • Online ISSN: 1683-8602
  • Print ISSN: 1680-743X

About

  • About journal

For contributors

  • Submit
  • OA Policy
  • Become a Peer-reviewer

Contact us

  • JDS@ruc.edu.cn
  • No. 59 Zhongguancun Street, Haidian District Beijing, 100872, P.R. China
Powered by PubliMill  •  Privacy policy