<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">JDS</journal-id>
      <journal-title-group>
        <journal-title>Journal of Data Science</journal-title>
      </journal-title-group>
      <issn pub-type="epub">1680-743X</issn>
      <issn pub-type="ppub">1680-743X</issn>
      <publisher>
        <publisher-name>SOSRUC</publisher-name>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="publisher-id">1704</article-id>
      <article-id pub-id-type="doi">10.6339/JDS.201901_17(1).0004</article-id>
      <article-categories>
        <subj-group subj-group-type="heading">
          <subject>Research Article</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>A Ensemble Machine Learning Based System for Merchant Credit Risk Detection in Merchant Mcc Misuse</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <name>
            <surname>Su</surname>
            <given-names>Chih-Hsiung</given-names>
          </name>
          <xref ref-type="aff" rid="j_JDS_aff_000"/>
        </contrib>
        <aff id="j_JDS_aff_000">Department of Accounting Information, Chihlee University of Technology, New Taipei City, Taiwan</aff>
        <contrib contrib-type="author">
          <name>
            <surname>Tu</surname>
            <given-names>Fengjun</given-names>
          </name>
          <xref ref-type="aff" rid="j_JDS_aff_001"/>
        </contrib>
        <aff id="j_JDS_aff_001">School of Business Administration, Guizhou University of Finance and Economics,Guiyang, China</aff>
        <contrib contrib-type="author">
          <name>
            <surname>Zhang</surname>
            <given-names>Xinyu</given-names>
          </name>
          <xref ref-type="aff" rid="j_JDS_aff_002"/>
        </contrib>
        <aff id="j_JDS_aff_002">Guiyang No. 1 High School, Guiyang, China</aff>
        <contrib contrib-type="author">
          <name>
            <surname>Shia</surname>
            <given-names>Ben-Chang</given-names>
          </name>
          <xref ref-type="aff" rid="j_JDS_aff_003"/>
        </contrib>
        <aff id="j_JDS_aff_003">College of Management, Taipei Medical University, Taipei, Taiwan</aff>
        <contrib contrib-type="author">
          <name>
            <surname>Lee</surname>
            <given-names>Tian-Shyug</given-names>
          </name>
          <xref ref-type="aff" rid="j_JDS_aff_004"/>
        </contrib>
        <aff id="j_JDS_aff_004">Graduate Institute of Business Administration, College of Management, Fu Jen Catholic University, New Taipei City, Taiwan</aff>
      </contrib-group>
      <volume>17</volume>
      <issue>1</issue>
      <fpage>81</fpage>
      <lpage>106</lpage>
      <permissions>
        <ali:free_to_read xmlns:ali="http://www.niso.org/schemas/ali/1.0/"/>
      </permissions>
      <abstract>
        <p>Although credit score models have been widely applied, one of the important variables-Merchant Category Code (MCC)-is sometimes misused. MCC misuse may cause errors in credit scoring systems. The present study aimed to develop and deploy an MCC misuse detection system with ensemble models, gives insights into the development process and compares different machine learning methods. XGBoost exhibited the best performance, with overall error, sensitivity, specificity, F_1 score, AUC and PRAUC of 0.1095, 0.7777, 0.9672, 0.8518, 0.9095 and 0.9090, respectively. MCC misuse by merchants can be predicted with satisfactory accuracy by using our ensemble-based detection system. The paper can thus not only suggest the MCC misuse cannot be overlooked but also help researchers and practitioners to apply new ensemble machine learning based detection system or similar problems.</p>
      </abstract>
      <kwd-group>
        <label>Keywords</label>
        <kwd>MCC misuse</kwd>
        <kwd>credit risk</kwd>
        <kwd>ensemble machine learning</kwd>
      </kwd-group>
    </article-meta>
  </front>
</article>
