<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">JDS</journal-id>
      <journal-title-group>
        <journal-title>Journal of Data Science</journal-title>
      </journal-title-group>
      <issn pub-type="epub">1680-743X</issn>
      <issn pub-type="ppub">1680-743X</issn>
      <publisher>
        <publisher-name>SOSRUC</publisher-name>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="publisher-id">S03</article-id>
      <article-id pub-id-type="doi">10.6339/JDS.202012_18(5).0003</article-id>
      <article-categories>
        <subj-group subj-group-type="heading">
          <subject>Research Article</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>On Public Sentiment and Topic Mining during the COVID-19 Pandemic Based on Sina Weibo Comment Data</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <name>
            <surname>Du</surname>
            <given-names>Xiaomeng</given-names>
          </name>
          <xref ref-type="aff" rid="j_JDS_aff_000"/>
        </contrib>
        <aff id="j_JDS_aff_000">Data Science Lab, Percent Co., Beijing, China</aff>
        <contrib contrib-type="author">
          <name>
            <surname>Huang</surname>
            <given-names>Wei</given-names>
          </name>
          <xref ref-type="aff" rid="j_JDS_aff_001"/>
        </contrib>
        <aff id="j_JDS_aff_001">Data Science Lab, Percent Co., Beijing, China</aff>
        <contrib contrib-type="author">
          <name>
            <surname>Liu</surname>
            <given-names>Yijing</given-names>
          </name>
          <xref ref-type="aff" rid="j_JDS_aff_002"/>
        </contrib>
        <aff id="j_JDS_aff_002">Data Science Lab, Percent Co., Beijing, China</aff>
        <contrib contrib-type="author">
          <name>
            <surname>Su</surname>
            <given-names>Haibo</given-names>
          </name>
          <xref ref-type="aff" rid="j_JDS_aff_003"/>
        </contrib>
        <aff id="j_JDS_aff_003">Data Science Lab, Percent Co., Beijing, China</aff>
      </contrib-group>
      <volume>18</volume>
      <issue>5</issue>
      <fpage>875</fpage>
      <lpage>888</lpage>
      <permissions>
        <ali:free_to_read xmlns:ali="http://www.niso.org/schemas/ali/1.0/"/>
      </permissions>
      <abstract>
        <p>In the wake of the COVID-19 outbreak, the public resorted to Sina Weibo as a major platform for the trend of the pandemic. Research on public sentiment and topic mining of major public sentiment events based on Sina Weibo’s comment data is important for understanding the trend of public opinions during major epidemic outbreaks. Based on classification of the Chinese language into emotion categories in psychology, we use open source tools to build naive Bayesian models to classify Weibo comments. Visualization of comment topics is achieved with word co-occurrence network methods. Commented topics are mined with the help of the latent Dirichlet distribution model. The results show that the psychological sentiment classification combined with the naive Bayesian model can reflect the evolvement of public sentiment during the epidemic, and that the latent Dirichlet distribution model and word co-occurrence network can effectively mine the topics of public concerns.</p>
      </abstract>
      <kwd-group>
        <label>Keywords</label>
        <kwd>Dirichlet distribution</kwd>
        <kwd>emotional classification</kwd>
        <kwd>naive Bayes</kwd>
        <kwd>visualization</kwd>
        <kwd>word co-occurrence</kwd>
      </kwd-group>
    </article-meta>
  </front>
</article>
