Causal Clustering: Design of Cluster Experiments Under Network Interference: References

cover
31 Jan 2024

Authors:

(1) Davide Viviano, Department of Economics, Harvard University;

(2) Lihua Lei, Graduate School of Business, Stanford University;

(3) Guido Imbens, Graduate School of Business and Department of Economics, Stanford University;

(4) Brian Karrer, FAIR, Meta;

(5) Okke Schrijvers, Meta Central Applied Science;

(6) Liang Shi, Meta Central Applied Science.

Table of Links

Abstract & Introduction

Setup

(When) should you cluster?

Choosing the cluster design

Empirical illustration and numerical studies

Recommendations for practice

References

A) Notation

B) Endogenous peer effects

C) Proofs

References

Alberto Abadie, Susan Athey, Guido W Imbens, and Jeffrey Wooldridge. When should you adjust standard errors for clustering? Technical report, National Bureau of Economic Research, 2017.

Vivi Alatas, Abhijit Banerjee, Rema Hanna, Benjamin A Olken, and Julia Tobias. Targeting the poor: evidence from a field experiment in indonesia. American Economic Review, 102 (4):1206–1240, 2012.

Peter M Aronow and Cyrus Samii. Estimating average causal effects under general interference, with application to a social network experiment. The Annals of Applied Statistics, 11(4):1912–1947, 2017.

Susan Athey, Dean Eckles, and Guido W Imbens. Exact p-values for network interference. Journal of the American Statistical Association, 113(521):230–240, 2018.

Sarah Baird, J Aislinn Bohren, Craig McIntosh, and Berk Ozler. Optimal design of experi- ¨ ments in the presence of interference. Review of Economics and Statistics, 100(5):844–860, 2018.

G Basse and A Feller. Analyzing multilevel experiments in the presence of peer effects. arXiv preprint arXiv, 1608, 2016.

Guillaume W Basse and Edoardo M Airoldi. Limitations of design-based causal inference and a/b testing under arbitrary and network interference. Sociological Methodology, 48(1): 136–151, 2018a.

Guillaume W Basse and Edoardo M Airoldi. Model-assisted design of experiments in the presence of network-correlated outcomes. Biometrika, 105(4):849–858, 2018b.

Vincent D Blondel, Jean-Loup Guillaume, Renaud Lambiotte, and Etienne Lefebvre. Fast unfolding of communities in large networks. Journal of statistical mechanics: theory and experiment, 2008(10):P10008, 2008.

Paul S Bradley, Kristin P Bennett, and Ayhan Demiriz. Constrained k-means clustering. Microsoft Research, Redmond, 20(0):0, 2000.

Yann Bramoull´e, Habiba Djebbari, and Bernard Fortin. Identification of peer effects through social networks. Journal of econometrics, 150(1):41–55, 2009.

Emily Breza, Arun G Chandrasekhar, Tyler H McCormick, and Mengjie Pan. Using aggregated relational data to feasibly identify network structure without network data. American Economic Review, 110(8):2454–2484, 2020.

Jing Cai, Alain De Janvry, and Elisabeth Sadoulet. Social networks and the decision to insure. American Economic Journal: Applied Economics, 7(2):81–108, 2015.

Aureo De Paula, Seth Richards-Shubik, and Elie Tamer. Identifying preferences in networks ´ with bounded degree. Econometrica, 86(1):263–288, 2018.

Dean Eckles, Brian Karrer, and Johan Ugander. Design and analysis of experiments in networks: Reducing bias from interference. Journal of Causal Inference, 5(1), 2017.

Dennis Egger, Johannes Haushofer, Edward Miguel, Paul Niehaus, and Michael Walker. General equilibrium effects of cash transfers: experimental evidence from kenya. Econometrica, 90(6):2603–2643, 2022.

Stefan Faridani and Paul Niehaus. Rate-optimal linear estimation of average global effects. arXiv preprint arXiv:2209.14181, 2022.

Paul Goldsmith-Pinkham and Guido W Imbens. Social networks and the identification of peer effects. Journal of Business & Economic Statistics, 31(3):253–264, 2013.

Keisuke Hirano and Jack R Porter. Asymptotics for statistical treatment rules. Econometrica, 77(5):1683–1701, 2009.

David Holtz, Ruben Lobel, Inessa Liskovich, and Sinan Aral. Reducing interference bias in online marketplace pricing experiments. arXiv preprint arXiv:2004.12489, 2020.

Daniel Hong, Hyunwoo Lee, and Alex Wei. Optimal solutions and ranks in the max-cut sdp. arXiv preprint arXiv:2109.02238, 2021.

Michael G Hudgens and M Elizabeth Halloran. Toward causal inference with interference. Journal of the American Statistical Association, 103(482):832–842, 2008.

Ravi Jagadeesan, Natesh S Pillai, and Alexander Volfovsky. Designs for estimating the treatment effect in networks with interference. 2020.

Igor Kabiljo, Brian Karrer, Mayank Pundir, Sergey Pupyrev, Alon Shalita, Alessandro Presta, and Yaroslav Akhremtsev. Social hash partitioner: a scalable distributed hypergraph partitioner. arXiv preprint arXiv:1707.06665, 2017.

Hyunseung Kang and Guido Imbens. Peer encouragement designs in causal inference with partial interference and identification of local average network effects. arXiv preprint arXiv:1609.04464, 2016.

Brian Karrer, Liang Shi, Monica Bhole, Matt Goldman, Tyrone Palmer, Charlie Gelman, Mikael Konutgan, and Feng Sun. Network experimentation at scale. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, pages 3106– 3116, 2021.

Toru Kitagawa and Guanyi Wang. Who should get vaccinated? individualized allocation of vaccines over sir network. Journal of Econometrics, 2021.

Lihua Lei. Unified ℓ2→∞ eigenspace perturbation theory for symmetric random matrices. arXiv preprint arXiv:1909.04798, 2019.

Lihua Lei, Xiaodong Li, and Xingmei Lou. Consistency of spectral clustering on hierarchical stochastic block models. arXiv preprint arXiv:2004.14531, 2020.

Michael P Leung. Treatment and spillover effects under network interference. Review of Economics and Statistics, 102(2):368–380, 2020.

Michael P Leung. Causal inference under approximate neighborhood interference. Econometrica, 90(1):267–293, 2022.

Michael P Leung. Network cluster-robust inference. Econometrica, 91(2):641–667, 2023.

Shuangning Li and Stefan Wager. Random graph asymptotics for treatment effect estimation under network interference. The Annals of Statistics, 50(4):2334–2358, 2022.

Tianxi Li, Lihua Lei, Sharmodeep Bhattacharyya, Koen Van den Berge, Purnamrita Sarkar, Peter J Bickel, and Elizaveta Levina. Hierarchical community detection by recursive partitioning. Journal of the American Statistical Association, 117(538):951–968, 2022.

Shuyang Ling and Thomas Strohmer. Certifying global optimality of graph cuts via semidefinite relaxation: A performance guarantee for spectral clustering. Foundations of Computational Mathematics, 20(3):367–421, 2020.

Elena Manresa. Estimating the structure of social interactions using panel data. Unpublished Manuscript. CEMFI, Madrid, 2013.

Charles F Manski. Identification of endogenous social effects: The reflection problem. The review of economic studies, 60(3):531–542, 1993.

Charles F Manski. Identification of treatment response with social interactions. The Econometrics Journal, 16(1):S1–S23, 2013.

Karthik Muralidharan and Paul Niehaus. Experimentation at scale. Journal of Economic Perspectives, 31(4):103–24, 2017.

Mark EJ Newman. Community detection and graph partitioning. Europhysics Letters, 103 (2):28003, 2013a.

Mark EJ Newman. Spectral methods for community detection and graph partitioning. Physical Review E, 88(4):042822, 2013b.

Elizabeth L Ogburn, Oleg Sofrygin, Ivan Diaz, and Mark J van der Laan. Causal inference for social network data. arXiv preprint arXiv:1705.08527, 2017.

Jean Pouget-Abadie. Dealing with Interference on Experimentation Platforms. PhD thesis, 2018.

Fredrik S¨avje, Peter Aronow, and Michael Hudgens. Average treatment effects in the presence of unknown interference. Annals of statistics, 49(2):673, 2021.

Jianbo Shi and Jitendra Malik. Normalized cuts and image segmentation. IEEE Transactions on pattern analysis and machine intelligence, 22(8):888–905, 2000.

Betsy Sinclair, Margaret McConnell, and Donald P Green. Detecting spillover effects: Design and analysis of multilevel experiments. American Journal of Political Science, 56(4):1055– 1069, 2012.

Sean J Taylor and Dean Eckles. Randomized experiments to detect and estimate social influence in networks. In Complex Spreading Phenomena in Social Systems, pages 289– 322. Springer, 2018.

Johan Ugander, Brian Karrer, Lars Backstrom, and Jon Kleinberg. Graph cluster randomization: Network exposure to multiple universes. In Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 329–337. ACM, 2013.

Davide Viviano. Experimental design under network interference. arXiv preprint arXiv:2003.08421, 2020a.

Davide Viviano. Policy choice in experiments with unknown interference. arXiv preprint arXiv:2011.08174, 2020b.

Ulrike Von Luxburg. A tutorial on spectral clustering. Statistics and computing, 17:395–416, 2007.

Jeffrey M Wooldridge. Cluster-sample methods in applied econometrics. American Economic Review, 93(2):133–138, 2003.

This paper is available on arxiv under CC 1.0 license.