<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">JDS</journal-id>
<journal-title-group><journal-title>Journal of Data Science</journal-title></journal-title-group>
<issn pub-type="epub">1683-8602</issn><issn pub-type="ppub">1680-743X</issn><issn-l>1680-743X</issn-l>
<publisher>
<publisher-name>School of Statistics, Renmin University of China</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">JDS1112</article-id>
<article-id pub-id-type="doi">10.6339/23-JDS1112</article-id>
<article-categories><subj-group subj-group-type="heading">
<subject>Statistical Data Science</subject></subj-group></article-categories>
<title-group>
<article-title>Network A/B Testing: Nonparametric Statistical Significance Test Based on Cluster-Level Permutation</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Shang</surname><given-names>Hongwei</given-names></name><email xlink:href="mailto:hongwei.shang@walmart.com">hongwei.shang@walmart.com</email><xref ref-type="aff" rid="j_jds1112_aff_001">1</xref><xref ref-type="corresp" rid="cor1">∗</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Shi</surname><given-names>Xiaolin</given-names></name><xref ref-type="aff" rid="j_jds1112_aff_002">2</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Jiang</surname><given-names>Bai</given-names></name><xref ref-type="aff" rid="j_jds1112_aff_003">3</xref>
</contrib>
<aff id="j_jds1112_aff_001"><label>1</label><institution>Walmart Global Tech</institution>, <country>Search Dept, Sunnyvale, CA, US</country></aff>
<aff id="j_jds1112_aff_002"><label>2</label><institution>Snap Inc</institution>, <country>Santa Monica, CA, US</country></aff>
<aff id="j_jds1112_aff_003"><label>3</label><institution>Citadel Securities</institution>, <country>Chicago, IL, US</country></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>∗</label>Corresponding author. Email: <ext-link ext-link-type="uri" xlink:href="mailto:hongwei.shang@walmart.com">hongwei.shang@walmart.com</ext-link>.</corresp>
</author-notes>
<pub-date pub-type="ppub"><year>2023</year></pub-date><pub-date pub-type="epub"><day>25</day><month>7</month><year>2023</year></pub-date><volume>21</volume><issue>3</issue><fpage>523</fpage><lpage>537</lpage><supplementary-material id="S1" content-type="archive" xlink:href="jds1112_s001.zip" mimetype="application" mime-subtype="x-zip-compressed">
<caption>
<title>Supplementary Material</title>
<p>The zip supplementary material file contains the Python scripts for generating graph data, computing ATE estimators, estimating p-value via permutation tests, and generating figures in this paper.</p>
</caption>
</supplementary-material><history><date date-type="received"><day>14</day><month>7</month><year>2023</year></date><date date-type="accepted"><day>14</day><month>7</month><year>2023</year></date></history>
<permissions><copyright-statement>2023 The Author(s). Published by the School of Statistics and the Center for Applied Statistics, Renmin University of China.</copyright-statement><copyright-year>2023</copyright-year>
<license license-type="open-access" xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>Open access article under the <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">CC BY</ext-link> license.</license-p></license></permissions>
<abstract>
<p>A/B testing is widely used for comparing two versions of a product and evaluating new proposed product features. It is of great importance for decision-making and has been applied as a golden standard in the IT industry. It is essentially a form of two-sample statistical hypothesis testing. Average treatment effect (ATE) and the corresponding p-value can be obtained under certain assumptions. One key assumption in traditional A/B testing is the <italic>stable-unit-treatment-value assumption</italic> (SUTVA): there is no interference among different units. It means that the observation on one unit is unaffected by the particular assignment of treatments to the other units. Nonetheless, interference is very common in social network settings where people communicate and spread information to their neighbors. Therefore, the SUTVA assumption is violated. Analysis ignoring this network effect will lead to biased estimation of ATE. Most existing works focus mainly on the design of experiment and data analysis in order to produce estimators with good performance in regards to bias and variance. Little attention has been paid to the calculation of p-value. We work on the calculation of p-value for the ATE estimator in network A/B tests. After a brief review of existing research methods on design of experiment based on <italic>graph cluster randomization</italic> and different ATE estimation methods, we propose a permutation method for calculating p-value based on permutation test at the cluster level. The effectiveness of the method against that based on individual-level permutation is validated in a simulation study mimicking realistic settings.</p>
</abstract>
<kwd-group>
<label>Keywords</label>
<kwd>design of experiments</kwd>
<kwd>graph cluster randomization</kwd>
<kwd>p-value</kwd>
</kwd-group>
</article-meta>
</front>
<back>
<ref-list id="j_jds1112_reflist_001">
<title>References</title>
<ref id="j_jds1112_ref_001">
<mixed-citation publication-type="chapter"> <string-name><surname>Aronow</surname> <given-names>PM</given-names></string-name>, <string-name><surname>Samii</surname> <given-names>C</given-names></string-name> (<year>2012</year>). <chapter-title>Estimating average causal effects under general interference</chapter-title>. In: <source><italic>Summer Meeting of the Society for Political Methodology</italic></source>, <comment>University of North Carolina, Chapel Hill, July</comment>, <fpage>19</fpage>–<lpage>21</lpage>, <publisher-name>Citeseer</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1112_ref_002">
<mixed-citation publication-type="chapter"> <string-name><surname>Backstrom</surname> <given-names>L</given-names></string-name>, <string-name><surname>Kleinberg</surname> <given-names>J</given-names></string-name> (<year>2011</year>). <chapter-title>Network bucket testing</chapter-title>. In: <source><italic>Proceedings of the 20th International Conference on World Wide Web, WWW’11</italic></source>, <fpage>615</fpage>–<lpage>624</lpage>. <publisher-name>ACM</publisher-name>, <publisher-loc>New York, NY, USA</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1112_ref_003">
<mixed-citation publication-type="other"> <string-name><surname>Eckles</surname> <given-names>D</given-names></string-name>, <string-name><surname>Karrer</surname> <given-names>B</given-names></string-name>, <string-name><surname>Ugander</surname> <given-names>J</given-names></string-name> (2014). Design and analysis of experiments in networks: Reducing bias from interference. arXiv preprint: <uri>https://arxiv.org/abs/1404.7530</uri>.</mixed-citation>
</ref>
<ref id="j_jds1112_ref_004">
<mixed-citation publication-type="chapter"> <string-name><surname>Gui</surname> <given-names>H</given-names></string-name>, <string-name><surname>Xu</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Bhasin</surname> <given-names>A</given-names></string-name>, <string-name><surname>Han</surname> <given-names>J</given-names></string-name> (<year>2015</year>). <chapter-title>Network A/B testing: From sampling to estimation</chapter-title>. In: <source><italic>Proceedings of the 24th International Conference on World Wide Web, WWW’15</italic></source>, <fpage>399</fpage>–<lpage>409</lpage>. <publisher-name>ACM</publisher-name>, <publisher-loc>New York, NY, USA</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1112_ref_005">
<mixed-citation publication-type="chapter"> <string-name><surname>Gupta</surname> <given-names>A</given-names></string-name>, <string-name><surname>Krauthgamer</surname> <given-names>R</given-names></string-name>, <string-name><surname>Lee</surname> <given-names>JR</given-names></string-name> (<year>2003</year>). <chapter-title>Bounded geometries, fractals, and low-distortion embeddings</chapter-title>. In: <source><italic>Foundations of Computer Science, 2003. Proceedings. 44th Annual IEEE Symposium on</italic></source>, <fpage>534</fpage>–<lpage>543</lpage>. <publisher-name>IEEE</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1112_ref_006">
<mixed-citation publication-type="book"> <string-name><surname>Jiang</surname> <given-names>B</given-names></string-name>, <string-name><surname>Shi</surname> <given-names>X</given-names></string-name>, <string-name><surname>Shang</surname> <given-names>H</given-names></string-name>, <string-name><surname>Geng</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Glass</surname> <given-names>A</given-names></string-name> (<year>2016</year>). <source><italic>A Framework for Network A/B Test</italic></source>. <comment>arXiv preprint: <uri>https://arxiv.org/abs/1610.07670</uri></comment>.</mixed-citation>
</ref>
<ref id="j_jds1112_ref_007">
<mixed-citation publication-type="chapter"> <string-name><surname>Karrer</surname> <given-names>B</given-names></string-name>, <string-name><surname>Shi</surname> <given-names>L</given-names></string-name>, <string-name><surname>Bhole</surname> <given-names>M</given-names></string-name>, <string-name><surname>Goldman</surname> <given-names>M</given-names></string-name>, <string-name><surname>Palmer</surname> <given-names>T</given-names></string-name>, <string-name><surname>Gelman</surname> <given-names>C</given-names></string-name>, <etal>et al.</etal> (<year>2021</year>). <chapter-title>Network experimentation at scale</chapter-title>. In: <source><italic>Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery &amp; Data Mining</italic></source>, <fpage>3106</fpage>–<lpage>3116</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1112_ref_008">
<mixed-citation publication-type="journal"> <string-name><surname>Karypis</surname> <given-names>G</given-names></string-name>, <string-name><surname>Kumar</surname> <given-names>V</given-names></string-name> (<year>1998</year>). <article-title>Multilevel k-way partitioning scheme for irregular graphs</article-title>. <source><italic>Journal of Parallel and Distributed Computing</italic></source>, <volume>48</volume>(<issue>1</issue>): <fpage>96</fpage>–<lpage>129</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1006/jpdc.1997.1404" xlink:type="simple">https://doi.org/10.1006/jpdc.1997.1404</ext-link></mixed-citation>
</ref>
<ref id="j_jds1112_ref_009">
<mixed-citation publication-type="chapter"> <string-name><surname>Katzir</surname> <given-names>L</given-names></string-name>, <string-name><surname>Liberty</surname> <given-names>E</given-names></string-name>, <string-name><surname>Somekh</surname> <given-names>O</given-names></string-name> (<year>2012</year>). <chapter-title>Framework and algorithms for network bucket testing</chapter-title>. In: <source><italic>Proceedings of the 21st International Conference on World Wide Web, WWW’12</italic></source>, <fpage>1029</fpage>–<lpage>1036</lpage>. <publisher-name>ACM</publisher-name>, <publisher-loc>New York, NY, USA</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1112_ref_010">
<mixed-citation publication-type="chapter"> <string-name><surname>Kohavi</surname> <given-names>R</given-names></string-name>, <string-name><surname>Deng</surname> <given-names>A</given-names></string-name>, <string-name><surname>Frasca</surname> <given-names>B</given-names></string-name>, <string-name><surname>Longbotham</surname> <given-names>R</given-names></string-name>, <string-name><surname>Walker</surname> <given-names>T</given-names></string-name>, <string-name><surname>Xu</surname> <given-names>Y</given-names></string-name> (<year>2012</year>). <chapter-title>Trustworthy online controlled experiments: Five puzzling outcomes explained</chapter-title>. In: <source><italic>Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining</italic></source>, <fpage>786</fpage>–<lpage>794</lpage>. <publisher-name>ACM</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1112_ref_011">
<mixed-citation publication-type="chapter"> <string-name><surname>Kohavi</surname> <given-names>R</given-names></string-name>, <string-name><surname>Deng</surname> <given-names>A</given-names></string-name>, <string-name><surname>Frasca</surname> <given-names>B</given-names></string-name>, <string-name><surname>Walker</surname> <given-names>T</given-names></string-name>, <string-name><surname>Xu</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Pohlmann</surname> <given-names>N</given-names></string-name> (<year>2013</year>). <chapter-title>Online controlled experiments at large scale</chapter-title>. In: <source><italic>Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining</italic></source>, <fpage>1168</fpage>–<lpage>1176</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1112_ref_012">
<mixed-citation publication-type="chapter"> <string-name><surname>Kohavi</surname> <given-names>R</given-names></string-name>, <string-name><surname>Deng</surname> <given-names>A</given-names></string-name>, <string-name><surname>Longbotham</surname> <given-names>R</given-names></string-name>, <string-name><surname>Xu</surname> <given-names>Y</given-names></string-name> (<year>2014</year>). <chapter-title>Seven rules of thumb for web site experimenters</chapter-title>. In: <source><italic>Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD’14</italic></source>, <fpage>1857</fpage>–<lpage>1866</lpage>. <publisher-name>ACM</publisher-name>, <publisher-loc>New York, NY, USA</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1112_ref_013">
<mixed-citation publication-type="journal"> <string-name><surname>Kohavi</surname> <given-names>R</given-names></string-name>, <string-name><surname>Longbotham</surname> <given-names>R</given-names></string-name>, <string-name><surname>Walker</surname> <given-names>T</given-names></string-name> (<year>2010</year>). <article-title>Online experiments: Practical lessons</article-title>. <source><italic>Computer</italic></source>, <volume>43</volume>(<issue>9</issue>): <fpage>82</fpage>–<lpage>85</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1109/MC.2010.264" xlink:type="simple">https://doi.org/10.1109/MC.2010.264</ext-link></mixed-citation>
</ref>
<ref id="j_jds1112_ref_014">
<mixed-citation publication-type="book"> <string-name><surname>Kohavi</surname> <given-names>R</given-names></string-name>, <string-name><surname>Tang</surname> <given-names>D</given-names></string-name>, <string-name><surname>Xu</surname> <given-names>Y</given-names></string-name> (<year>2020</year>). <source><italic>Trustworthy Online Controlled Experiments: A Practical Guide to A/B Testing</italic></source>. <publisher-name>Cambridge University Press</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1112_ref_015">
<mixed-citation publication-type="chapter"> <string-name><surname>Liu</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Zhou</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Li</surname> <given-names>P</given-names></string-name>, <string-name><surname>Hu</surname> <given-names>F</given-names></string-name> (<year>2022</year>). <chapter-title>Adaptive A/B test on networks with cluster structures</chapter-title>. In: <source><italic>International Conference on Artificial Intelligence and Statistics</italic></source>, <fpage>10836</fpage>–<lpage>10851</lpage>. <publisher-name>PMLR</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1112_ref_016">
<mixed-citation publication-type="journal"> <string-name><surname>Maris</surname> <given-names>E</given-names></string-name>, <string-name><surname>Oostenveld</surname> <given-names>R</given-names></string-name> (<year>2007</year>). <article-title>Nonparametric statistical testing of EEG-and MEG-data</article-title>. <source><italic>Journal of Neuroscience Methods</italic></source>, <volume>164</volume>(<issue>1</issue>): <fpage>177</fpage>–<lpage>190</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1016/j.jneumeth.2007.03.024" xlink:type="simple">https://doi.org/10.1016/j.jneumeth.2007.03.024</ext-link></mixed-citation>
</ref>
<ref id="j_jds1112_ref_017">
<mixed-citation publication-type="chapter"> <string-name><surname>Nishimura</surname> <given-names>J</given-names></string-name>, <string-name><surname>Ugander</surname> <given-names>J</given-names></string-name> (<year>2013</year>). <chapter-title>Restreaming graph partitioning: simple versatile algorithms for advanced balancing</chapter-title>. In: <source><italic>Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining</italic></source>, <fpage>1106</fpage>–<lpage>1114</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1112_ref_018">
<mixed-citation publication-type="journal"> <string-name><surname>Rubin</surname> <given-names>DB</given-names></string-name> (<year>1986</year>). <article-title>Comment: Which ifs have causal answers?</article-title> <source><italic>Journal of the American Statistical Association</italic></source>, <volume>81</volume>(<issue>396</issue>): <fpage>961</fpage>–<lpage>962</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1112_ref_019">
<mixed-citation publication-type="chapter"> <string-name><surname>Saveski</surname> <given-names>M</given-names></string-name>, <string-name><surname>Pouget-Abadie</surname> <given-names>J</given-names></string-name>, <string-name><surname>Saint-Jacques</surname> <given-names>G</given-names></string-name>, <string-name><surname>Duan</surname> <given-names>W</given-names></string-name>, <string-name><surname>Ghosh</surname> <given-names>S</given-names></string-name>, <string-name><surname>Xu</surname> <given-names>Y</given-names></string-name>, <etal>et al.</etal> (<year>2017</year>). <chapter-title>Detecting network effects: Randomizing over randomized experiments</chapter-title>. In: <source><italic>Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining</italic></source>, <fpage>1027</fpage>–<lpage>1035</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1112_ref_020">
<mixed-citation publication-type="chapter"> <string-name><surname>Ugander</surname> <given-names>J</given-names></string-name>, <string-name><surname>Backstrom</surname> <given-names>L</given-names></string-name> (<year>2013</year>). <chapter-title>Balanced label propagation for partitioning massive graphs</chapter-title>. In: <source><italic>Proceedings of the Sixth ACM International Conference on Web Search and Data Mining</italic></source>, <fpage>507</fpage>–<lpage>516</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1112_ref_021">
<mixed-citation publication-type="chapter"> <string-name><surname>Ugander</surname> <given-names>J</given-names></string-name>, <string-name><surname>Karrer</surname> <given-names>B</given-names></string-name>, <string-name><surname>Backstrom</surname> <given-names>L</given-names></string-name>, <string-name><surname>Kleinberg</surname> <given-names>J</given-names></string-name> (<year>2013</year>). <chapter-title>Graph cluster randomization: Network exposure to multiple universes</chapter-title>. In: <source><italic>Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD’13</italic></source>, <fpage>329</fpage>–<lpage>337</lpage>. <publisher-name>ACM</publisher-name>, <publisher-loc>New York, NY, USA</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1112_ref_022">
<mixed-citation publication-type="journal"> <string-name><surname>Ugander</surname> <given-names>J</given-names></string-name>, <string-name><surname>Yin</surname> <given-names>H</given-names></string-name> (<year>2023</year>). <article-title>Randomized graph cluster randomization</article-title>. <source><italic>Journal of Causal Inference</italic></source>, <volume>11</volume>(<issue>1</issue>): <elocation-id>20220014</elocation-id>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1515/jci-2022-0014" xlink:type="simple">https://doi.org/10.1515/jci-2022-0014</ext-link></mixed-citation>
</ref>
<ref id="j_jds1112_ref_023">
<mixed-citation publication-type="journal"> <string-name><surname>Watts</surname> <given-names>DJ</given-names></string-name>, <string-name><surname>Strogatz</surname> <given-names>SH</given-names></string-name> (<year>1998</year>). <article-title>Collective dynamics of ‘small-world’ networks</article-title>. <source><italic>Nature</italic></source>, <volume>393</volume>(<issue>6684</issue>): <fpage>440</fpage>–<lpage>442</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1038/30918" xlink:type="simple">https://doi.org/10.1038/30918</ext-link></mixed-citation>
</ref>
</ref-list>
</back>
</article>
