<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">JDS</journal-id>
<journal-title-group><journal-title>Journal of Data Science</journal-title></journal-title-group>
<issn pub-type="epub">1683-8602</issn><issn pub-type="ppub">1680-743X</issn><issn-l>1680-743X</issn-l>
<publisher>
<publisher-name>School of Statistics, Renmin University of China</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">JDS1028</article-id>
<article-id pub-id-type="doi">10.6339/21-JDS1028</article-id>
<article-categories><subj-group subj-group-type="heading">
<subject>Statistical Data Science</subject></subj-group></article-categories>
<title-group>
<article-title>Integrative Clustering Analysis with Application in Multi-Source Gene Expression Data</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Yang</surname><given-names>Liuqing</given-names></name><xref ref-type="aff" rid="j_jds1028_aff_001">1</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Pan</surname><given-names>Qing</given-names></name><xref ref-type="aff" rid="j_jds1028_aff_001">1</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Zhao</surname><given-names>Yunpeng</given-names></name><email xlink:href="mailto:yunpeng.zhao@asu.edu">yunpeng.zhao@asu.edu</email><xref ref-type="aff" rid="j_jds1028_aff_002">2</xref><xref ref-type="corresp" rid="cor1">∗</xref>
</contrib>
<aff id="j_jds1028_aff_001"><label>1</label>Department of Statistics, <institution>George Washington University</institution>, Washington, D.C., <country>U.S.A.</country></aff>
<aff id="j_jds1028_aff_002"><label>2</label>School of Mathematical and Natural Sciences, <institution>Arizona State University</institution>, Tempe, Arizona, <country>U.S.A.</country></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>∗</label>Corresponding author. Email: <ext-link ext-link-type="uri" xlink:href="mailto:yunpeng.zhao@asu.edu">yunpeng.zhao@asu.edu</ext-link>.</corresp>
</author-notes>
<pub-date pub-type="ppub"><year>2022</year></pub-date><pub-date pub-type="epub"><day>9</day><month>11</month><year>2021</year></pub-date><volume>20</volume><issue>1</issue><fpage>14</fpage><lpage>33</lpage><supplementary-material id="S1" content-type="archive" xlink:href="jds1028_s001.zip" mimetype="application" mime-subtype="x-zip-compressed">
<caption>
<title>Supplementary Material</title>
<p>Code for the integrative analysis and the data used in the real data analysis are available at <ext-link ext-link-type="uri" xlink:href="https://github.com/yangliuqing1992/Integrative_clustering">https://github.com/yangliuqing1992/Integrative_clustering</ext-link>.</p>
</caption>
</supplementary-material><history><date date-type="received"><day>6</day><month>4</month><year>2021</year></date><date date-type="accepted"><day>12</day><month>10</month><year>2021</year></date></history>
<permissions><copyright-statement>2022 The Author(s). Published by the School of Statistics and the Center for Applied Statistics, Renmin University of China.</copyright-statement><copyright-year>2022</copyright-year>
<license license-type="open-access" xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>Open access article under the <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">CC BY</ext-link> license.</license-p></license></permissions>
<abstract>
<p>In omics studies, different sources of information about the same set of genes are often available. When the group structure (e.g., gene pathways) within the genes are of interests, we combine the normal hierarchical model with the stochastic block model, through an integrative clustering framework, to model gene expression and gene networks jointly. The integrative framework provides higher accuracy in extensive simulation studies when one or both of the data sources contain noises or when different data sources provide complementary information. An empirical guideline in the choice between integrative versus separate clustering models is proposed. The integrative clustering method is illustrated on the mouse embryo single cell RNAseq and bulk cell microarray data, which identified not only the gene sets shared by both data sources but also the gene sets unique in one data source.</p>
</abstract>
<kwd-group>
<label>Keywords</label>
<kwd>EM algorithm</kwd>
<kwd>empirical guidelines</kwd>
<kwd>microarray data</kwd>
<kwd>normal hierarchical model</kwd>
<kwd>single cell RNAseq</kwd>
<kwd>stochastic block model</kwd>
</kwd-group>
</article-meta>
</front>
<back>
<ref-list id="j_jds1028_reflist_001">
<title>References</title>
<ref id="j_jds1028_ref_001">
<mixed-citation publication-type="journal"> <string-name><surname>Abbe</surname> <given-names>E</given-names></string-name> (<year>2017</year>). <article-title>Community detection and stochastic block models: recent developments</article-title>. <source>The Journal of Machine Learning Research</source>, <volume>18</volume>(<issue>177</issue>): <fpage>1</fpage>–<lpage>86</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1028_ref_002">
<mixed-citation publication-type="journal"> <string-name><surname>Amini</surname> <given-names>AA</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>A</given-names></string-name>, <string-name><surname>Bickel</surname> <given-names>PJ</given-names></string-name>, <string-name><surname>Levina</surname> <given-names>E</given-names></string-name> (<year>2013</year>). <article-title>Pseudo-likelihood methods for community detection in large sparse networks</article-title>. <source>The Annals of Statistics</source>, <volume>41</volume>(<issue>4</issue>): <fpage>2097</fpage>–<lpage>2122</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1028_ref_003">
<mixed-citation publication-type="journal"> <string-name><surname>Andäng</surname> <given-names>M</given-names></string-name>, <string-name><surname>Moliner</surname> <given-names>A</given-names></string-name>, <string-name><surname>Doege</surname> <given-names>CA</given-names></string-name>, <string-name><surname>Ibañez</surname> <given-names>CF</given-names></string-name>, <string-name><surname>Ernfors</surname> <given-names>P</given-names></string-name> (<year>2008</year>). <article-title>Optimized mouse ES cell culture system by suspension growth in a fully defined medium</article-title>. <source>Nature Protocols</source>, <volume>3</volume>(<issue>6</issue>): <fpage>1013</fpage>–<lpage>1017</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1028_ref_004">
<mixed-citation publication-type="journal"> <string-name><surname>Bickel</surname> <given-names>PJ</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>A</given-names></string-name> (<year>2009</year>). <article-title>A nonparametric view of network models and Newman–Girvan and other modularities</article-title>. <source>Proceedings of the National Academy of Sciences</source>, <volume>106</volume>(<issue>50</issue>): <fpage>21068</fpage>–<lpage>21073</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1028_ref_005">
<mixed-citation publication-type="journal"> <string-name><surname>Di</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Schafer</surname> <given-names>DW</given-names></string-name>, <string-name><surname>Cumbie</surname> <given-names>JS</given-names></string-name>, <string-name><surname>Chang</surname> <given-names>JH</given-names></string-name> (<year>2011</year>). <article-title>The NBP negative binomial model for assessing differential gene expression from RNA-Seq</article-title>. <source>Statistical Applications in Genetics and Molecular Biology</source>, <volume>10</volume>(<issue>1</issue>): <fpage>1</fpage>–<lpage>28</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1028_ref_006">
<mixed-citation publication-type="book"> <string-name><surname>Fletcher</surname> <given-names>R</given-names></string-name> (<year>2013</year>). <source>Practical Methods of Optimization</source>. <publisher-name>John Wiley &amp; Sons</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1028_ref_007">
<mixed-citation publication-type="journal"> <string-name><surname>Forcato</surname> <given-names>M</given-names></string-name>, <string-name><surname>Romano</surname> <given-names>O</given-names></string-name>, <string-name><surname>Bicciato</surname> <given-names>S</given-names></string-name> (<year>2021</year>). <article-title>Computational methods for the integrative analysis of single-cell data</article-title>. <source>Briefings in Bioinformatics</source>, <volume>22</volume>(<issue>3</issue>): <fpage>1</fpage>–<lpage>10</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1028_ref_008">
<mixed-citation publication-type="journal"> <string-name><surname>Goldenberg</surname> <given-names>A</given-names></string-name>, <string-name><surname>Zheng</surname> <given-names>AX</given-names></string-name>, <string-name><surname>Fienberg</surname> <given-names>SE</given-names></string-name>, <string-name><surname>Airoldi</surname> <given-names>EM</given-names></string-name> (<year>2010</year>). <article-title>A survey of statistical network models</article-title>. <source>Foundations and Trends in Machine Learning</source>, <volume>2</volume>(<issue>2</issue>): <fpage>129</fpage>–<lpage>233</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1028_ref_009">
<mixed-citation publication-type="journal"> <string-name><surname>Hoadley</surname> <given-names>KA</given-names></string-name>, <string-name><surname>Yau</surname> <given-names>C</given-names></string-name>, <string-name><surname>Wolf</surname> <given-names>DM</given-names></string-name>, <string-name><surname>Cherniack</surname> <given-names>AD</given-names></string-name>, <string-name><surname>Tamborero</surname> <given-names>D</given-names></string-name>, <string-name><surname>Ng</surname> <given-names>S</given-names></string-name>, <etal>et al.</etal> (<year>2014</year>). <article-title>Multiplatform analysis of 12 cancer types reveals molecular classification within and across tissues of origin</article-title>. <source>Cell</source>, <volume>158</volume>(<issue>4</issue>): <fpage>929</fpage>–<lpage>944</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1028_ref_010">
<mixed-citation publication-type="journal"> <string-name><surname>Holland</surname> <given-names>PW</given-names></string-name>, <string-name><surname>Laskey</surname> <given-names>KB</given-names></string-name>, <string-name><surname>Leinhardt</surname> <given-names>S</given-names></string-name> (<year>1983</year>). <article-title>Stochastic blockmodels: First steps</article-title>. <source>Social Networks</source>, <volume>5</volume>(<issue>2</issue>): <fpage>109</fpage>–<lpage>137</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1028_ref_011">
<mixed-citation publication-type="journal"> <string-name><surname>Hu</surname> <given-names>J</given-names></string-name>, <string-name><surname>Qin</surname> <given-names>H</given-names></string-name>, <string-name><surname>Yan</surname> <given-names>T</given-names></string-name>, <string-name><surname>Zhao</surname> <given-names>Y</given-names></string-name> (<year>2020</year>). <article-title>Corrected bayesian information criterion for stochastic block models</article-title>. <source>Journal of the American Statistical Association</source>, <volume>115</volume>(<issue>532</issue>): <fpage>1771</fpage>–<lpage>1783</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1028_ref_012">
<mixed-citation publication-type="journal"> <string-name><surname>Islam</surname> <given-names>S</given-names></string-name>, <string-name><surname>Kjällquist</surname> <given-names>U</given-names></string-name>, <string-name><surname>Moliner</surname> <given-names>A</given-names></string-name>, <string-name><surname>Zajac</surname> <given-names>P</given-names></string-name>, <string-name><surname>Fan</surname> <given-names>JB</given-names></string-name>, <string-name><surname>Lönnerberg</surname> <given-names>P</given-names></string-name>, <etal>et al.</etal> (<year>2011</year>). <article-title>Characterization of the single-cell transcriptional landscape by highly multiplex RNA-seq</article-title>. <source>Genome Research</source>, <volume>21</volume>(<issue>7</issue>): <fpage>1160</fpage>–<lpage>1167</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1028_ref_013">
<mixed-citation publication-type="journal"> <string-name><surname>Karrer</surname> <given-names>B</given-names></string-name>, <string-name><surname>Newman</surname> <given-names>ME</given-names></string-name> (<year>2011</year>). <article-title>Stochastic blockmodels and community structure in networks</article-title>. <source>Physical Review E</source>, <volume>83</volume>(<issue>1</issue>): <fpage>016107</fpage>.</mixed-citation>
</ref>
<ref id="j_jds1028_ref_014">
<mixed-citation publication-type="journal"> <string-name><surname>Lock</surname> <given-names>EF</given-names></string-name>, <string-name><surname>Dunson</surname> <given-names>DB</given-names></string-name> (<year>2013</year>). <article-title>Bayesian consensus clustering</article-title>. <source>Bioinformatics</source>, <volume>29</volume>(<issue>20</issue>): <fpage>2610</fpage>–<lpage>2616</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1028_ref_015">
<mixed-citation publication-type="journal"> <string-name><surname>Lock</surname> <given-names>EF</given-names></string-name>, <string-name><surname>Hoadley</surname> <given-names>KA</given-names></string-name>, <string-name><surname>Marron</surname> <given-names>JS</given-names></string-name>, <string-name><surname>Nobel</surname> <given-names>AB</given-names></string-name> (<year>2013</year>). <article-title>Joint and individual variation explained (JIVE) for integrated analysis of multiple data types</article-title>. <source>The Annals of Applied Statistics</source>, <volume>7</volume>(<issue>1</issue>): <fpage>523</fpage>–<lpage>542</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1028_ref_016">
<mixed-citation publication-type="journal"> <string-name><surname>Mo</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>S</given-names></string-name>, <string-name><surname>Seshan</surname> <given-names>VE</given-names></string-name>, <string-name><surname>Olshen</surname> <given-names>AB</given-names></string-name>, <string-name><surname>Schultz</surname> <given-names>N</given-names></string-name>, <string-name><surname>Sander</surname> <given-names>C</given-names></string-name>, <etal>et al.</etal> (<year>2013</year>). <article-title>Pattern discovery and cancer gene identification in integrated cancer genomic data</article-title>. <source>Proceedings of the National Academy of Sciences</source>, <volume>110</volume>(<issue>11</issue>): <fpage>4245</fpage>–<lpage>4250</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1028_ref_017">
<mixed-citation publication-type="journal"> <string-name><surname>Moliner</surname> <given-names>A</given-names></string-name>, <string-name><surname>Enfors</surname> <given-names>P</given-names></string-name>, <string-name><surname>Ibáñez</surname> <given-names>CF</given-names></string-name>, <string-name><surname>Andäng</surname> <given-names>M</given-names></string-name> (<year>2008</year>). <article-title>Mouse embryonic stem cell-derived spheres with distinct neurogenic potentials</article-title>. <source>Stem Cells and Development</source>, <volume>17</volume>(<issue>2</issue>): <fpage>233</fpage>–<lpage>243</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1028_ref_018">
<mixed-citation publication-type="journal"> <string-name><surname>Morris</surname> <given-names>CN</given-names></string-name>, <string-name><surname>Lysy</surname> <given-names>M</given-names></string-name> (<year>2012</year>). <article-title>Shrinkage estimation in multilevel normal models</article-title>. <source>Statistical Science</source>, <volume>27</volume>(<issue>1</issue>): <fpage>115</fpage>–<lpage>134</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1028_ref_019">
<mixed-citation publication-type="journal"> <string-name><surname>Newman</surname> <given-names>ME</given-names></string-name>, <string-name><surname>Clauset</surname> <given-names>A</given-names></string-name> (<year>2016</year>). <article-title>Structure and inference in annotated networks</article-title>. <source>Nature Communications</source>, <volume>7</volume>(<issue>1</issue>): <fpage>1</fpage>–<lpage>11</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1028_ref_020">
<mixed-citation publication-type="journal"> <string-name><surname>Nguyen</surname> <given-names>T</given-names></string-name>, <string-name><surname>Tagett</surname> <given-names>R</given-names></string-name>, <string-name><surname>Diaz</surname> <given-names>D</given-names></string-name>, <string-name><surname>Draghici</surname> <given-names>S</given-names></string-name> (<year>2017</year>). <article-title>A novel approach for data integration and disease subtyping</article-title>. <source>Genome Research</source>, <volume>27</volume>(<issue>12</issue>): <fpage>2025</fpage>–<lpage>2039</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1028_ref_021">
<mixed-citation publication-type="journal"> <string-name><surname>Perkins</surname> <given-names>AD</given-names></string-name>, <string-name><surname>Langston</surname> <given-names>MA</given-names></string-name> (<year>2009</year>). <article-title>Threshold selection in gene co-expression networks using spectral graph theory techniques</article-title>. <source>BMC Bioinformatics</source>, <volume>10</volume>: <fpage>1</fpage>–<lpage>11</lpage>. </mixed-citation>
</ref>
<ref id="j_jds1028_ref_022">
<mixed-citation publication-type="journal"> <string-name><surname>Priness</surname> <given-names>I</given-names></string-name>, <string-name><surname>Maimon</surname> <given-names>O</given-names></string-name>, <string-name><surname>Ben-Gal</surname> <given-names>I</given-names></string-name> (<year>2007</year>). <article-title>Evaluation of gene-expression clustering via mutual information distance measure</article-title>. <source>BMC Bioinformatics</source>, <volume>8</volume>(<issue>1</issue>): <fpage>1</fpage>–<lpage>12</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1028_ref_023">
<mixed-citation publication-type="journal"> <string-name><surname>Rand</surname> <given-names>WM</given-names></string-name> (<year>1971</year>). <article-title>Objective criteria for the evaluation of clustering methods</article-title>. <source>Journal of the American Statistical Association</source>, <volume>66</volume>(<issue>336</issue>): <fpage>846</fpage>–<lpage>850</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1028_ref_024">
<mixed-citation publication-type="journal"> <string-name><surname>Rappoport</surname> <given-names>N</given-names></string-name>, <string-name><surname>Shamir</surname> <given-names>R</given-names></string-name> (<year>2018</year>). <article-title>Multi-omic and multi-view clustering algorithms: Review and cancer benchmark</article-title>. <source>Nucleic acids research</source>, <volume>46</volume>(<issue>20</issue>): <fpage>10546</fpage>–<lpage>10562</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1028_ref_025">
<mixed-citation publication-type="journal"> <string-name><surname>Saldana</surname> <given-names>DF</given-names></string-name>, <string-name><surname>Yu</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Feng</surname> <given-names>Y</given-names></string-name> (<year>2017</year>). <article-title>How many communities are there?</article-title> <source>Journal of Computational and Graphical Statistics</source>, <volume>26</volume>(<issue>1</issue>): <fpage>171</fpage>–<lpage>181</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1028_ref_026">
<mixed-citation publication-type="journal"> <string-name><surname>Thompson</surname> <given-names>A</given-names></string-name>, <string-name><surname>May</surname> <given-names>MR</given-names></string-name>, <string-name><surname>Moore</surname> <given-names>BR</given-names></string-name>, <string-name><surname>Kopp</surname> <given-names>A</given-names></string-name> (<year>2020</year>). <article-title>A hierarchical bayesian mixture model for inferring the expression state of genes in transcriptomes</article-title>. <source>Proceedings of the National Academy of Sciences</source>, <volume>117</volume>(<issue>32</issue>): <fpage>19339</fpage>–<lpage>19346</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1028_ref_027">
<mixed-citation publication-type="journal"> <string-name><surname>Wang</surname> <given-names>B</given-names></string-name>, <string-name><surname>Mezlini</surname> <given-names>AM</given-names></string-name>, <string-name><surname>Demir</surname> <given-names>F</given-names></string-name>, <string-name><surname>Fiume</surname> <given-names>M</given-names></string-name>, <string-name><surname>Tu</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Brudno</surname> <given-names>M</given-names></string-name>, <etal>et al.</etal> (<year>2014</year>). <article-title>Similarity network fusion for aggregating data types on a genomic scale</article-title>. <source>Nature Methods</source>, <volume>11</volume>(<issue>3</issue>): <fpage>333</fpage>–<lpage>337</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1028_ref_028">
<mixed-citation publication-type="chapter"> <string-name><surname>Wang</surname> <given-names>H</given-names></string-name>, <string-name><surname>Nie</surname> <given-names>F</given-names></string-name>, <string-name><surname>Huang</surname> <given-names>H</given-names></string-name> (<year>2013</year>). <chapter-title>Multi-view clustering and feature learning via structured sparsity</chapter-title>. In: <string-name><surname>Dasgupta</surname> <given-names>S</given-names></string-name>, <string-name><surname>McAllester</surname> <given-names>D</given-names></string-name> (eds.) <source>International Conference on Machine Learning</source>, <fpage>352</fpage>–<lpage>360</lpage>. </mixed-citation>
</ref>
<ref id="j_jds1028_ref_029">
<mixed-citation publication-type="journal"> <string-name><surname>Wang</surname> <given-names>T</given-names></string-name>, <string-name><surname>Li</surname> <given-names>B</given-names></string-name>, <string-name><surname>Nelson</surname> <given-names>CE</given-names></string-name>, <string-name><surname>Nabavi</surname> <given-names>S</given-names></string-name> (<year>2019</year>). <article-title>Comparative analysis of differential gene expression analysis tools for single-cell RNA sequencing data</article-title>. <source>BMC Bioinformatics</source>, <volume>20</volume>(<issue>1</issue>): <fpage>1</fpage>–<lpage>16</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1028_ref_030">
<mixed-citation publication-type="journal"> <string-name><surname>Wu</surname> <given-names>D</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>D</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>MQ</given-names></string-name>, <string-name><surname>Gu</surname> <given-names>J</given-names></string-name> (<year>2015</year>). <article-title>Fast dimension reduction and integrative clustering of multi-omics data using low-rank approximation: Application to cancer molecular classification</article-title>. <source>BMC Genomics</source>, <volume>16</volume>(<issue>1</issue>): <fpage>1022</fpage>.</mixed-citation>
</ref>
<ref id="j_jds1028_ref_031">
<mixed-citation publication-type="chapter"> <string-name><surname>Xu</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Ke</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Cheng</surname> <given-names>H</given-names></string-name>, <string-name><surname>Cheng</surname> <given-names>J</given-names></string-name> (<year>2012</year>). <chapter-title>A model-based approach to attributed graph clustering</chapter-title>. In: <string-name><surname>Dasgupta</surname> <given-names>S</given-names></string-name>, <string-name><surname>McAllester</surname> <given-names>D</given-names></string-name> (eds.) <source>Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data</source>, <fpage>505</fpage>–<lpage>516</lpage>. </mixed-citation>
</ref>
<ref id="j_jds1028_ref_032">
<mixed-citation publication-type="journal"> <string-name><surname>Yan</surname> <given-names>B</given-names></string-name>, <string-name><surname>Sarkar</surname> <given-names>P</given-names></string-name> (<year>2021</year>). <article-title>Covariate regularized community detection in sparse graphs</article-title>. <source>Journal of the American Statistical Association</source>, <volume>116</volume>(<issue>534</issue>): <fpage>734</fpage>–<lpage>745</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1028_ref_033">
<mixed-citation publication-type="journal"> <string-name><surname>Zhang</surname> <given-names>B</given-names></string-name>, <string-name><surname>Horvath</surname> <given-names>S</given-names></string-name> (<year>2005</year>). <article-title>A general framework for weighted gene co-expression network analysis</article-title>. <source>Statistical Applications in Genetics and Molecular Biology</source>, <volume>4</volume>(<issue>1</issue>): <elocation-id>17</elocation-id>.</mixed-citation>
</ref>
<ref id="j_jds1028_ref_034">
<mixed-citation publication-type="journal"> <string-name><surname>Zhang</surname> <given-names>S</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>CC</given-names></string-name>, <string-name><surname>Li</surname> <given-names>W</given-names></string-name>, <string-name><surname>Shen</surname> <given-names>H</given-names></string-name>, <string-name><surname>Laird</surname> <given-names>PW</given-names></string-name>, <string-name><surname>Zhou</surname> <given-names>XJ</given-names></string-name> (<year>2012</year>). <article-title>Discovery of multi-dimensional modules by integrative analysis of cancer genomic data</article-title>. <source>Nucleic Acids Research</source>, <volume>40</volume>(<issue>19</issue>): <fpage>9379</fpage>–<lpage>9391</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1028_ref_035">
<mixed-citation publication-type="journal"> <string-name><surname>Zhao</surname> <given-names>Y</given-names></string-name> (<year>2017</year>). <article-title>A survey on theoretical advances of community detection in networks</article-title>. <source>Wiley Interdisciplinary Reviews: Computational Statistics</source>, <volume>9</volume>(<issue>5</issue>): <elocation-id>e1403</elocation-id>.</mixed-citation>
</ref>
<ref id="j_jds1028_ref_036">
<mixed-citation publication-type="journal"> <string-name><surname>Zhao</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Levina</surname> <given-names>E</given-names></string-name>, <string-name><surname>Zhu</surname> <given-names>J</given-names></string-name> (<year>2012</year>). <article-title>Consistency of community detection in networks under degree-corrected stochastic block models</article-title>. <source>The Annals of Statistics</source>, <volume>40</volume>(<issue>4</issue>): <fpage>2266</fpage>–<lpage>2292</lpage>.</mixed-citation>
</ref>
</ref-list>
</back>
</article>
