<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">JDS</journal-id>
      <journal-title-group>
        <journal-title>Journal of Data Science</journal-title>
      </journal-title-group>
      <issn pub-type="epub">1680-743X</issn>
      <issn pub-type="ppub">1680-743X</issn>
      <publisher>
        <publisher-name>SOSRUC</publisher-name>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="publisher-id">150304</article-id>
	  <article-id pub-id-type="doi">10.6339/JDS.201707_15(3).0004</article-id>
      <article-categories>
        <subj-group subj-group-type="heading">
          <subject>Research Article</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>Comparison of Cox Regression and Parametric Models for Survival Analysis of Genetic Variants in Hnf1b Gene Related to Age at Onset of Cancer</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <name>
            <surname>Wang</surname>
            <given-names>Kesheng</given-names>
          </name>
          <xref ref-type="aff" rid="j_JDS_aff_000"/>
        </contrib>
        <aff id="j_JDS_aff_000">Department of Biostatistics and Epidemiology, East Tennessee State University, Johnson City, TN 37614, USA</aff>
        <contrib contrib-type="author">
          <name>
            <surname>Liu</surname>
            <given-names>Xuefeng</given-names>
          </name>
          <xref ref-type="aff" rid="j_JDS_aff_001"/>
        </contrib>
        <aff id="j_JDS_aff_001">Department of Systems Leadership and Effectiveness Science, University of Michigan, Ann Arbor, MI 48109-5482, USA</aff>
        <contrib contrib-type="author">
          <name>
            <surname>Pan</surname>
            <given-names>Yue</given-names>
          </name>
          <xref ref-type="aff" rid="j_JDS_aff_002"/>
        </contrib>
        <aff id="j_JDS_aff_002">Department of Public Health Sciences, Miller School of Medicine, University of Miami, Miami, FL 33136, USA</aff>
        <contrib contrib-type="author">
          <name>
            <surname>Owusu</surname>
            <given-names>Daniel</given-names>
          </name>
          <xref ref-type="aff" rid="j_JDS_aff_003"/>
        </contrib>
        <aff id="j_JDS_aff_003">Department of Biostatistics and Epidemiology, East Tennessee State University, Johnson City, TN 37614, USA</aff>
        <contrib contrib-type="author">
          <name>
            <surname>Xu</surname>
            <given-names>Chun</given-names>
          </name>
          <xref ref-type="aff" rid="j_JDS_aff_004"/>
        </contrib>
        <aff id="j_JDS_aff_004">Department of Health and Biomedical Sciences, University of Texas Rio Grande Valley, Brownsville, TX 78520, USA</aff>
      </contrib-group>
      <volume>15</volume>
      <issue>3</issue>
      <fpage>423</fpage>
      <lpage>442</lpage>
      <permissions>
        <ali:free_to_read xmlns:ali="http://www.niso.org/schemas/ali/1.0/"/>
      </permissions>
      <abstract>
        <p>Semi-parametric Cox regression and parametric methods have been used to analyze survival data of cancer; however, no study has focused on the comparison of survival models in genetic association analysis of age at onset (AAO) of cancer. The Hepatocyte nuclear factor-1- beta (HNF1B) gene has been associated with risk of endometrial and prostate cancers; however, no study has focused on the effect of HNF1B gene on the AAO of cancer. This study examined 23 single nucleotide polymorphisms (SNPs) within the HNF1B gene in the Marshfield sample with 716 cancer cases and 2,848 non-cancer controls. Cox proportional hazards models in PROC PHREG and parametric survival models (including exponential, Weibull, log-normal, log-logistic, and gamma models) in PROC LIFEREG in SAS 9.4 were used to detect the genetic association of HNF1B gene with the AAO. The Akaike information criterion (AIC) and Bayesian information criterion (BIC) were used to compare the Cox models and parametric survival models. Both AIC and BIC values showed that the Weibull distribution is the best model for all the 23 SNPs and the Gamma distribution is the second best. The top two SNPs are rs4239217 and rs7501939 with time ratio (TR) =1.08 (p&lt;0.0001 for the AA and AG genotypes, respectively) and 1.07 (p=0.0004 and 0.0002 for CC and CT genotypes, respectively) based on the Weibull model, respectively. This study shows that the parametric Weibull distribution is the best model for the genetic association of AAO of cancer and provides the first evidence of several genetic variants within the HNF1B gene associated with AAO of cancer.</p>
      </abstract>
      <kwd-group>
        <label>Keywords</label>
        <kwd>Cancer</kwd>
        <kwd>age at onset</kwd>
        <kwd>HNF1B</kwd>
        <kwd>SNP</kwd>
        <kwd>survival analysis</kwd>
      </kwd-group>
    </article-meta>
  </front>
</article>
