<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">JDS</journal-id>
      <journal-title-group>
        <journal-title>Journal of Data Science</journal-title>
      </journal-title-group>
      <issn pub-type="epub">1680-743X</issn>
      <issn pub-type="ppub">1680-743X</issn>
      <publisher>
        <publisher-name>SOSRUC</publisher-name>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="publisher-id">2017_4-9</article-id>
	  <article-id pub-id-type="doi">10.6339/JDS.201704_15(2).0009</article-id>
      <article-categories>
        <subj-group subj-group-type="heading">
          <subject>Research Article</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>General Semiparametric Area Under the Curve Regression Model with Discrete Covariates</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <name>
            <surname>Bohora</surname>
            <given-names>Som B.</given-names>
          </name>
          <xref ref-type="aff" rid="j_JDS_aff_000"/>
        </contrib>
        <aff id="j_JDS_aff_000">Department of Pediatrics, The University of Oklahoma Health Sciences, Oklahoma City, Oklahoma 73104, USA</aff>
        <contrib contrib-type="author">
          <name>
            <surname>Zhao</surname>
            <given-names>Yan D.</given-names>
          </name>
          <xref ref-type="aff" rid="j_JDS_aff_001"/>
        </contrib>
        <aff id="j_JDS_aff_001">Department of Biostatistics and Epidemiology, College of Public Health, The University of Oklahoma Health Sciences Center, Oklahoma City, OK 73104, USA</aff>
        <contrib contrib-type="author">
          <name>
            <surname>Balachova</surname>
            <given-names>Tatiana N.</given-names>
          </name>
          <xref ref-type="aff" rid="j_JDS_aff_002"/>
        </contrib>
        <aff id="j_JDS_aff_002">Department of Pediatrics, The University of Oklahoma Health Sciences, Oklahoma City, Oklahoma 73104, USA</aff>
      </contrib-group>
      <volume>15</volume>
      <issue>2</issue>
      <fpage>329</fpage>
      <lpage>344</lpage>
      <permissions>
        <ali:free_to_read xmlns:ali="http://www.niso.org/schemas/ali/1.0/"/>
      </permissions>
      <abstract>
        <p>In this article, we considered the analysis of data with a non-normally distributed response variable. In particular, we extended an existing Area Under the Curve (AUC) regression model that handles only two discrete covariates to a general AUC regression model that can be used to analyze data with unrestricted number of discrete covariates. Comparing with other similar methods which require iterative algorithms and bootstrap procedure, our method involved only closed-form formulae for parameter estimation. Additionally, we also discussed the issue of model identifiability. Our model has broad applicability in clinical trials due to the ease of interpretation on model parameters. We applied our model to analyze a clinical trial evaluating the effects of educational brochures for preventing Fetal Alcohol Spectrum Disorders (FASD). Finally, for a variety of simulation scenarios, our method produced parameter estimates with small biases and confidence intervals with nominal coverage probabilities.</p>
      </abstract>
      <kwd-group>
        <label>Keywords</label>
        <kwd>AUC</kwd>
        <kwd>clinical trial</kwd>
        <kwd>discrete covariates</kwd>
        <kwd>nonparametric</kwd>
        <kwd>semiparametric</kwd>
      </kwd-group>
    </article-meta>
  </front>
</article>
