<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">JDS</journal-id>
<journal-title-group><journal-title>Journal of Data Science</journal-title></journal-title-group>
<issn pub-type="epub">1683-8602</issn><issn pub-type="ppub">1680-743X</issn><issn-l>1680-743X</issn-l>
<publisher>
<publisher-name>School of Statistics, Renmin University of China</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">JDS1230</article-id>
<article-id pub-id-type="doi">10.6339/26-JDS1230</article-id>
<article-categories><subj-group subj-group-type="heading">
<subject>Statistical Data Science</subject></subj-group></article-categories>
<title-group>
<article-title>Leveraging Survey Metadata for LLM Reasoning via Knowledge Graphs</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<contrib-id contrib-id-type="orcid">https://orcid.org/0000-0003-2977-1936</contrib-id>
<name><surname>Belyaeva</surname><given-names>Irina</given-names></name><email xlink:href="mailto:irinabelaeva@gmail.com">irinabelaeva@gmail.com</email><email xlink:href="mailto:irina.belyaeva@census.gov">irina.belyaeva@census.gov</email><xref ref-type="aff" rid="j_jds1230_aff_001">1</xref><xref ref-type="corresp" rid="cor1">∗</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Carino</surname><given-names>Christopher</given-names></name><xref ref-type="aff" rid="j_jds1230_aff_001">1</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Wang</surname><given-names>Liang-Chi</given-names></name><xref ref-type="aff" rid="j_jds1230_aff_001">1</xref>
</contrib>
<aff id="j_jds1230_aff_001"><label>1</label><institution>Research and Methodology Directorate, Center for Enterprise Dissemination, U.S. Census Bureau</institution>, Suitland, Maryland 20746, <country>United States</country></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>∗</label>Corresponding author. Email: <ext-link ext-link-type="uri" xlink:href="mailto:irinabelaeva@gmail.com">irinabelaeva@gmail.com</ext-link> or <ext-link ext-link-type="uri" xlink:href="mailto:irina.belyaeva@census.gov">irina.belyaeva@census.gov</ext-link>.</corresp>
</author-notes>
<pub-date pub-type="ppub"><year>2026</year></pub-date><pub-date pub-type="epub"><day>21</day><month>5</month><year>2026</year></pub-date><volume content-type="ahead-of-print">0</volume><issue>0</issue><fpage>1</fpage><lpage>22</lpage><supplementary-material id="S1" content-type="archive" xlink:href="jds1230_s001.zip" mimetype="application" mime-subtype="x-zip-compressed">
<caption>
<title>Supplementary Material</title>
<p>Appendices A-C.</p>
</caption>
</supplementary-material><history><date date-type="received"><day>9</day><month>9</month><year>2025</year></date><date date-type="accepted"><day>9</day><month>4</month><year>2026</year></date></history>
<permissions><copyright-statement>2026 The Author(s). Published by the School of Statistics and the Center for Applied Statistics, Renmin University of China.</copyright-statement><copyright-year>2026</copyright-year>
<license license-type="open-access" xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>Open access article under the <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">CC BY</ext-link> license.</license-p></license></permissions>
<abstract>
<p>Statistical survey metadata contains essential contextual information that underpins the accurate interpretation, discovery, and reuse of statistical data. However, traditional metadata formats are not optimized for consumption by large language models (LLMs), which increasingly function as interfaces for data exploration, question-answering, and decision support. This work introduces a knowledge graph-based approach to modeling survey metadata using semantic web standards and linked data principles, specifically designed to make metadata machine-understandable and LLM-compatible. The core metadata entities, including surveys, datasets, variables, concepts, populations, and provenance, are modeled as rich interlinked nodes that allow reasoning, contextual enrichment, and structured prompting. The graph integrates established ontologies such as the Resource Description Framework (RDF) to promote interoperability and alignment with global standards. We demonstrate how this structure allows LLMs to surface relevant metadata, ground their outputs in authoritative sources, and generate semantically precise responses. This approach enhances transparency, facilitates metadata reuse, and supports the development of artificial intelligence (AI) applications powered by statistical products.</p>
</abstract>
<kwd-group>
<label>Keywords</label>
<kwd>large language models</kwd>
<kwd>linked data</kwd>
<kwd>link prediction</kwd>
<kwd>metadata interoperability</kwd>
<kwd>retrieval-augmented generation</kwd>
<kwd>semantic search</kwd>
<kwd>statistical knowledge graphs</kwd>
</kwd-group>
</article-meta>
</front>
<back>
<ref-list id="j_jds1230_reflist_001">
<title>References</title>
<ref id="j_jds1230_ref_001">
<mixed-citation publication-type="journal"> <string-name><surname>Abu-Salih</surname> <given-names>B</given-names></string-name> (<year>2021</year>). <article-title>Domain-specific knowledge graphs: A survey</article-title>. <source><italic>Journal of Network and Computer Applications</italic></source>, <volume>185</volume>: <elocation-id>103076</elocation-id>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1016/j.jnca.2021.103076" xlink:type="simple">https://doi.org/10.1016/j.jnca.2021.103076</ext-link></mixed-citation>
</ref>
<ref id="j_jds1230_ref_002">
<mixed-citation publication-type="chapter"> <string-name><surname>Bang</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Cahyawijaya</surname> <given-names>S</given-names></string-name>, <string-name><surname>Lee</surname> <given-names>N</given-names></string-name>, <string-name><surname>Dai</surname> <given-names>W</given-names></string-name>, <string-name><surname>Su</surname> <given-names>D</given-names></string-name>, ..., <string-name><surname>Fung</surname> <given-names>P</given-names></string-name> (<year>2023</year>). <chapter-title>A multitask, multilingual, multimodal evaluation of ChatGPT on reasoning, hallucination, and interactivity</chapter-title>. In: <source><italic>Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (Volume 1: Long Papers)</italic></source> (<string-name><given-names>JC</given-names> <surname>Park</surname></string-name>, <string-name><given-names>Y</given-names> <surname>Arase</surname></string-name>, <string-name><given-names>B</given-names> <surname>Hu</surname></string-name>, <string-name><given-names>W</given-names> <surname>Lu</surname></string-name>, <string-name><given-names>D</given-names> <surname>Wijaya</surname></string-name>, <string-name><given-names>A</given-names> <surname>Purwarianti</surname></string-name>, <string-name><given-names>AA</given-names> <surname>Krisnadhi</surname></string-name>, eds.), <fpage>675</fpage>–<lpage>718</lpage>. <publisher-name>Association for Computational Linguistics, Nusa Dua</publisher-name>, <publisher-loc>Bali</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_003">
<mixed-citation publication-type="journal"> <string-name><surname>Bennett</surname> <given-names>M</given-names></string-name> (<year>2013</year>). <article-title>The financial industry business ontology: Best practice for big data</article-title>. <source><italic>Journal of Banking Regulation</italic></source>, <volume>14</volume>(<issue>3</issue>): <fpage>255</fpage>–<lpage>268</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1057/jbr.2013.13" xlink:type="simple">https://doi.org/10.1057/jbr.2013.13</ext-link></mixed-citation>
</ref>
<ref id="j_jds1230_ref_004">
<mixed-citation publication-type="journal"> <string-name><surname>Bodenreider</surname> <given-names>O</given-names></string-name> (<year>2004</year>). <article-title>The unified medical language system (umls): Integrating biomedical terminology</article-title>. <source><italic>Nucleic acids research</italic></source>. <volume>32</volume>(<issue>suppl_1</issue>): <fpage>D267</fpage>–<lpage>D270</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_005">
<mixed-citation publication-type="chapter"> <string-name><surname>Bouma</surname> <given-names>G</given-names></string-name> (<year>2009</year>). <chapter-title>Normalized (pointwise) mutual information in collocation extraction</chapter-title>. In: <source><italic>Proceedings of the Biennial GSCL Conference: From Form to Meaning—Processing Texts Automatically</italic></source> (<string-name><given-names>C</given-names> <surname>Chiarcos</surname></string-name>, <string-name><given-names>RE</given-names> <surname>de Castilho</surname></string-name>, <string-name><given-names>M</given-names> <surname>Stede</surname></string-name>, eds.), <fpage>31</fpage>–<lpage>40</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_006">
<mixed-citation publication-type="journal"> <string-name><surname>Brown</surname> <given-names>T</given-names></string-name>, <string-name><surname>Mann</surname> <given-names>B</given-names></string-name>, <string-name><surname>Ryder</surname> <given-names>N</given-names></string-name>, <string-name><surname>Subbiah</surname> <given-names>M</given-names></string-name>, <string-name><surname>Kaplan</surname> <given-names>JD</given-names></string-name>, ..., <string-name><surname>Amodei</surname> <given-names>D</given-names></string-name> (<year>2020</year>). <article-title>Language models are few-shot learners</article-title>. <source><italic>Advances in Neural Information Processing Systems</italic></source>, <volume>33</volume>: <fpage>1877</fpage>–<lpage>1901</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_007">
<mixed-citation publication-type="chapter"> <string-name><surname>Carlson</surname> <given-names>A</given-names></string-name>, <string-name><surname>Betteridge</surname> <given-names>J</given-names></string-name>, <string-name><surname>Kisiel</surname> <given-names>B</given-names></string-name>, <string-name><surname>Settles</surname> <given-names>B</given-names></string-name>, <string-name><surname>Hruschka</surname> <given-names>E</given-names></string-name>, <string-name><surname>Mitchell</surname> <given-names>T</given-names></string-name> (<year>2010</year>). <chapter-title>Toward an architecture for never-ending language learning</chapter-title>. In: <source><italic>Proceedings of the AAAI Conference on Artificial Intelligence</italic></source> (<string-name><given-names>M</given-names> <surname>Fox</surname></string-name>, <string-name><given-names>D</given-names> <surname>Poole</surname></string-name>, eds.), volume <volume>24</volume>, <fpage>1306</fpage>–<lpage>1313</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_008">
<mixed-citation publication-type="chapter"> <string-name><surname>Christiano</surname> <given-names>PF</given-names></string-name>, <string-name><surname>Leike</surname> <given-names>J</given-names></string-name>, <string-name><surname>Brown</surname> <given-names>T</given-names></string-name>, <string-name><surname>Martic</surname> <given-names>M</given-names></string-name>, <string-name><surname>Legg</surname> <given-names>S</given-names></string-name>, <string-name><surname>Amodei</surname> <given-names>D</given-names></string-name> (<year>2017</year>). Deep reinforcement learning from human preferences. <italic>Advances in neural information processing systems</italic>, 30.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_009">
<mixed-citation publication-type="other"> <string-name><surname>Cyganiak</surname> <given-names>R</given-names></string-name>, <string-name><surname>Wood</surname> <given-names>D</given-names></string-name>, <string-name><surname>Lanthaler</surname> <given-names>M</given-names></string-name> (<year>2014</year>). RDF 1.1 concepts and abstract syntax. <uri>https://www.w3.org/TR/2014/REC-rdf11-concepts-20140225/</uri>. W3C Recommendation. 25 February 2014.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_010">
<mixed-citation publication-type="other"> <string-name><surname>Dai</surname> <given-names>D</given-names></string-name>, <string-name><surname>Dong</surname> <given-names>L</given-names></string-name>, <string-name><surname>Hao</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Sui</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Chang</surname> <given-names>B</given-names></string-name>, <string-name><surname>Wei</surname> <given-names>F</given-names></string-name> (<year>2021</year>). Knowledge neurons in pretrained transformers. arXiv preprint.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_011">
<mixed-citation publication-type="other"> <string-name><surname>Devlin</surname> <given-names>J</given-names></string-name> (<year>2018</year>). Bert: Pre-training of deep bidirectional transformers for language understanding/arxiv preprint. arXiv preprint: arXiv:<ext-link ext-link-type="uri" xlink:href="https://arxiv.org/abs/1810.04805">1810.04805</ext-link></mixed-citation>
</ref>
<ref id="j_jds1230_ref_012">
<mixed-citation publication-type="chapter"> <string-name><surname>Golovneva</surname> <given-names>O</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>M</given-names></string-name>, <string-name><surname>Poff</surname> <given-names>S</given-names></string-name>, <string-name><surname>Corredor</surname> <given-names>M</given-names></string-name>, <string-name><surname>Zettlemoyer</surname> <given-names>L</given-names></string-name>, ..., <string-name><surname>Celikyilmaz</surname> <given-names>A</given-names></string-name> (<year>2023</year>). <chapter-title>ROSCOE: A suite of metrics for scoring step-by-step reasoning</chapter-title>. In: <source><italic>Proceedings of the Eleventh International Conference on Learning Representations (ICLR)</italic></source>.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_013">
<mixed-citation publication-type="other"> <string-name><surname>Grattafiori</surname> <given-names>A</given-names></string-name>, <string-name><surname>Dubey</surname> <given-names>A</given-names></string-name>, <string-name><surname>Jauhri</surname> <given-names>A</given-names></string-name>, <string-name><surname>Pandey</surname> <given-names>A</given-names></string-name>, <string-name><surname>Kadian</surname> <given-names>A</given-names></string-name>, ..., <string-name><surname>Ma</surname> <given-names>Z</given-names></string-name> (<year>2024</year>). The llama 3 herd of models. arXiv preprint: arXiv:<ext-link ext-link-type="uri" xlink:href="https://arxiv.org/abs/2407.21783">2407.21783</ext-link></mixed-citation>
</ref>
<ref id="j_jds1230_ref_014">
<mixed-citation publication-type="other"> <string-name><surname>Grootendorst</surname> <given-names>M</given-names></string-name> (<year>2022</year>). Bertopic: Neural topic modeling with a class-based tf-idf procedure. arXiv preprint: arXiv:<ext-link ext-link-type="uri" xlink:href="https://arxiv.org/abs/2203.05794">2203.05794</ext-link></mixed-citation>
</ref>
<ref id="j_jds1230_ref_015">
<mixed-citation publication-type="journal"> <string-name><surname>Hastings</surname> <given-names>J</given-names></string-name>, <string-name><surname>Chepelev</surname> <given-names>L</given-names></string-name>, <string-name><surname>Willighagen</surname> <given-names>E</given-names></string-name>, <string-name><surname>Adams</surname> <given-names>N</given-names></string-name>, <string-name><surname>Steinbeck</surname> <given-names>C</given-names></string-name>, <string-name><surname>Dumontier</surname> <given-names>M</given-names></string-name> (<year>2011</year>). <article-title>The chemical information ontology: Provenance and disambiguation for chemical data on the biological semantic web</article-title>. <source><italic>PLoS ONE</italic></source>, <volume>6</volume>(<issue>10</issue>): <elocation-id>e25513</elocation-id>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1371/journal.pone.0025513" xlink:type="simple">https://doi.org/10.1371/journal.pone.0025513</ext-link></mixed-citation>
</ref>
<ref id="j_jds1230_ref_016">
<mixed-citation publication-type="journal"> <string-name><surname>Hu</surname> <given-names>N</given-names></string-name>, <string-name><surname>Wu</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Qi</surname> <given-names>G</given-names></string-name>, <string-name><surname>Min</surname> <given-names>D</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>J</given-names></string-name>, ..., <string-name><surname>Ali</surname> <given-names>Z</given-names></string-name> (<year>2023</year>). <article-title>An empirical study of pre-trained language models in simple knowledge graph question answering</article-title>. <source><italic>World Wide Web</italic></source>, <volume>26</volume>(<issue>5</issue>): <fpage>2855</fpage>–<lpage>2886</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1007/s11280-023-01166-y" xlink:type="simple">https://doi.org/10.1007/s11280-023-01166-y</ext-link></mixed-citation>
</ref>
<ref id="j_jds1230_ref_017">
<mixed-citation publication-type="other"> <string-name><surname>Hu</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Xu</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Yu</surname> <given-names>W</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>S</given-names></string-name>, <string-name><surname>Yang</surname> <given-names>Z</given-names></string-name>, ..., <string-name><surname>Sun</surname> <given-names>Y</given-names></string-name> (<year>2022</year>). Empowering language models with knowledge graph reasoning for question answering. arXiv preprint: arXiv:<ext-link ext-link-type="uri" xlink:href="https://arxiv.org/abs/2211.08380">2211.08380</ext-link></mixed-citation>
</ref>
<ref id="j_jds1230_ref_018">
<mixed-citation publication-type="other"> <string-name><surname>International Organization for Standardization</surname></string-name> (<year>2013</year>). Statistical data and metadata exchange (SDMX).</mixed-citation>
</ref>
<ref id="j_jds1230_ref_019">
<mixed-citation publication-type="journal"> <string-name><surname>Järvelin</surname> <given-names>K</given-names></string-name>, <string-name><surname>Kekäläinen</surname> <given-names>J</given-names></string-name> (<year>2002</year>). <article-title>Cumulated gain-based evaluation of ir techniques</article-title>. <source><italic>ACM Transactions on Information Systems</italic></source>, <volume>20</volume>(<issue>4</issue>): <fpage>422</fpage>–<lpage>446</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1145/582415.582418" xlink:type="simple">https://doi.org/10.1145/582415.582418</ext-link></mixed-citation>
</ref>
<ref id="j_jds1230_ref_020">
<mixed-citation publication-type="journal"> <string-name><surname>Ji</surname> <given-names>S</given-names></string-name>, <string-name><surname>Pan</surname> <given-names>S</given-names></string-name>, <string-name><surname>Cambria</surname> <given-names>E</given-names></string-name>, <string-name><surname>Marttinen</surname> <given-names>P</given-names></string-name>, <string-name><surname>Yu</surname> <given-names>PS</given-names></string-name> (<year>2021</year>). <article-title>A survey on knowledge graphs: Representation, acquisition, and applications</article-title>. <source><italic>IEEE Transactions on Neural Networks and Learning Systems</italic></source>, <volume>33</volume>(<issue>2</issue>): <fpage>494</fpage>–<lpage>514</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1109/TNNLS.2021.3070843" xlink:type="simple">https://doi.org/10.1109/TNNLS.2021.3070843</ext-link></mixed-citation>
</ref>
<ref id="j_jds1230_ref_021">
<mixed-citation publication-type="journal"> <string-name><surname>Ji</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Lee</surname> <given-names>N</given-names></string-name>, <string-name><surname>Frieske</surname> <given-names>R</given-names></string-name>, <string-name><surname>Yu</surname> <given-names>T</given-names></string-name>, <string-name><surname>Su</surname> <given-names>D</given-names></string-name>, ..., <string-name><surname>Fung</surname> <given-names>P</given-names></string-name> (<year>2023</year>). <article-title>Survey of hallucination in natural language generation</article-title>. <source><italic>ACM Computing Surveys</italic></source>, <volume>55</volume>(<issue>12</issue>): <fpage>1</fpage>–<lpage>38</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1145/3571730" xlink:type="simple">https://doi.org/10.1145/3571730</ext-link></mixed-citation>
</ref>
<ref id="j_jds1230_ref_022">
<mixed-citation publication-type="other"> <string-name><surname>Kevian</surname> <given-names>D</given-names></string-name>, <string-name><surname>Syed</surname> <given-names>U</given-names></string-name>, <string-name><surname>Guo</surname> <given-names>X</given-names></string-name>, <string-name><surname>Havens</surname> <given-names>A</given-names></string-name>, <string-name><surname>Dullerud</surname> <given-names>G</given-names></string-name>, ..., <string-name><surname>Hu</surname> <given-names>B</given-names></string-name> (<year>2024</year>). Capabilities of large language models in control engineering: A benchmark study on gpt-4, claude 3 opus, and gemini 1.0 ultra. arXiv preprint: arXiv:<ext-link ext-link-type="uri" xlink:href="https://arxiv.org/abs/2404.03647">2404.03647</ext-link></mixed-citation>
</ref>
<ref id="j_jds1230_ref_023">
<mixed-citation publication-type="chapter"> <string-name><surname>Lau</surname> <given-names>JH</given-names></string-name>, <string-name><surname>Newman</surname> <given-names>D</given-names></string-name>, <string-name><surname>Baldwin</surname> <given-names>T</given-names></string-name> (<year>2014</year>). <chapter-title>Machine reading tea leaves: Automatically evaluating topic coherence and topic model quality</chapter-title>. In: <source><italic>Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics</italic></source>, <fpage>530</fpage>–<lpage>539</lpage>. <publisher-name>Association for Computational Linguistics</publisher-name>, <publisher-loc>Gothenburg, Sweden</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_024">
<mixed-citation publication-type="journal"> <string-name><surname>Lewis</surname> <given-names>P</given-names></string-name>, <string-name><surname>Perez</surname> <given-names>E</given-names></string-name>, <string-name><surname>Piktus</surname> <given-names>A</given-names></string-name>, <string-name><surname>Petroni</surname> <given-names>F</given-names></string-name>, <string-name><surname>Karpukhin</surname> <given-names>V</given-names></string-name>, ..., <string-name><surname>Kiela</surname> <given-names>D</given-names></string-name> (<year>2020</year>). <article-title>Retrieval-augmented generation for knowledge-intensive nlp tasks</article-title>. <source><italic>Advances in Neural Information Processing Systems</italic></source>, <volume>33</volume>: <fpage>9459</fpage>–<lpage>9474</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_025">
<mixed-citation publication-type="other"> <string-name><surname>Li</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>C</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>H</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>S</given-names></string-name>, <string-name><surname>Gao</surname> <given-names>C</given-names></string-name> (<year>2022</year>). Cctest: Testing and repairing code completion systems. <italic>2023 ieee/acm 45th international conference on software engineering (icse) (2022)</italic>, 1238–1250.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_026">
<mixed-citation publication-type="chapter"> <string-name><surname>Lin</surname> <given-names>BY</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>X</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>J</given-names></string-name>, <string-name><surname>Ren</surname> <given-names>X</given-names></string-name> (<year>2019</year>). <chapter-title>KagNet: Knowledge-aware graph networks for commonsense reasoning</chapter-title>. In: <source><italic>Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)</italic></source> (<string-name><given-names>K</given-names> <surname>Inui</surname></string-name>, <string-name><given-names>J</given-names> <surname>Jiang</surname></string-name>, <string-name><given-names>V</given-names> <surname>Ng</surname></string-name>, <string-name><given-names>X</given-names> <surname>Wan</surname></string-name>, eds.), <fpage>2829</fpage>–<lpage>2839</lpage>. <publisher-name>Association for Computational Linguistics</publisher-name>, <publisher-loc>Hong Kong, China</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_027">
<mixed-citation publication-type="other"> <string-name><surname>Liu</surname> <given-names>J</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>C</given-names></string-name>, <string-name><surname>Zhou</surname> <given-names>P</given-names></string-name>, <string-name><surname>Lv</surname> <given-names>R</given-names></string-name>, <string-name><surname>Zhou</surname> <given-names>K</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>Y</given-names></string-name> (<year>2023</year>). Is chatgpt a good recommender? a preliminary study. arXiv preprint: arXiv:<ext-link ext-link-type="uri" xlink:href="https://arxiv.org/abs/2304.10149">2304.10149</ext-link></mixed-citation>
</ref>
<ref id="j_jds1230_ref_028">
<mixed-citation publication-type="chapter"> <string-name><surname>Liu</surname> <given-names>NF</given-names></string-name>, <string-name><surname>Gardner</surname> <given-names>M</given-names></string-name>, <string-name><surname>Belinkov</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Peters</surname> <given-names>ME</given-names></string-name>, <string-name><surname>Smith</surname> <given-names>NA</given-names></string-name> (<year>2019</year>). <chapter-title>Linguistic knowledge and transferability of contextual representations</chapter-title>. In: <source><italic>Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies</italic></source> (<string-name><given-names>J</given-names> <surname>Burstein</surname></string-name>, <string-name><given-names>C</given-names> <surname>Doran</surname></string-name>, <string-name><given-names>T</given-names> <surname>Solorio</surname></string-name>, eds.), volume <volume>1</volume> of <series><italic>Long and Short Papers</italic></series>, <fpage>1073</fpage>–<lpage>1094</lpage>. <publisher-name>Association for Computational Linguistics</publisher-name>, <publisher-loc>Minneapolis, Minnesota</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_029">
<mixed-citation publication-type="chapter"> <string-name><surname>Liu</surname> <given-names>W</given-names></string-name>, <string-name><surname>Zhou</surname> <given-names>P</given-names></string-name>, <string-name><surname>Zhao</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Ju</surname> <given-names>Q</given-names></string-name>, ..., <string-name><surname>Wang</surname> <given-names>P</given-names></string-name> (<year>2020</year>). <chapter-title>K-bert: Enabling language representation with knowledge graph</chapter-title>. In: <source><italic>Proceedings of the AAAI Conference on Artificial Intelligence</italic></source>, volume <volume>34</volume>, <fpage>2901</fpage>–<lpage>2908</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_030">
<mixed-citation publication-type="other"> <string-name><surname>Liu</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Ott</surname> <given-names>M</given-names></string-name>, <string-name><surname>Goyal</surname> <given-names>N</given-names></string-name>, <string-name><surname>Du</surname> <given-names>J</given-names></string-name>, <string-name><surname>Joshi</surname> <given-names>M</given-names></string-name>, ..., <string-name><surname>Stoyanov</surname> <given-names>V</given-names></string-name> (<year>2019</year>). Roberta: A robustly optimized bert pretraining approach. arXiv preprint: arXiv:<ext-link ext-link-type="uri" xlink:href="https://arxiv.org/abs/1907.11692">1907.11692</ext-link></mixed-citation>
</ref>
<ref id="j_jds1230_ref_031">
<mixed-citation publication-type="chapter"> <string-name><surname>Liu</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Wan</surname> <given-names>Y</given-names></string-name>, <string-name><surname>He</surname> <given-names>L</given-names></string-name>, <string-name><surname>Peng</surname> <given-names>H</given-names></string-name>, <string-name><surname>Yu</surname> <given-names>PS</given-names></string-name> (<year>2021</year>). <chapter-title>KG-bart: Knowledge graph-augmented bart for generative commonsense reasoning</chapter-title>. In: <source><italic>Proceedings of the AAAI Conference on Artificial Intelligence</italic></source>, volume <volume>35</volume>, <fpage>6418</fpage>–<lpage>6425</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_032">
<mixed-citation publication-type="chapter"> <string-name><surname>Logan</surname> <given-names>R</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>NF</given-names></string-name>, <string-name><surname>Peters</surname> <given-names>ME</given-names></string-name>, <string-name><surname>Gardner</surname> <given-names>M</given-names></string-name>, <string-name><surname>Singh</surname> <given-names>S</given-names></string-name> (<year>2019</year>). <chapter-title>Barack’s wife hillary: Using knowledge graphs for fact-aware language modeling</chapter-title>. In: <source><italic>Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics</italic></source> (<string-name><given-names>A</given-names> <surname>Korhonen</surname></string-name>, <string-name><given-names>D</given-names> <surname>Traum</surname></string-name>, <string-name><given-names>L</given-names> <surname>Màrquez</surname></string-name>, eds.), <fpage>5962</fpage>–<lpage>5971</lpage>. <publisher-name>Association for Computational Linguistics</publisher-name>, <publisher-loc>Florence, Italy</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_033">
<mixed-citation publication-type="chapter"> <string-name><surname>Luo</surname> <given-names>D</given-names></string-name>, <string-name><surname>Su</surname> <given-names>J</given-names></string-name>, <string-name><surname>Yu</surname> <given-names>S</given-names></string-name> (<year>2020</year>). <chapter-title>A bert-based approach with relation-aware attention for knowledge base question answering</chapter-title>. In: <source><italic>2020 International Joint Conference on Neural Networks (IJCNN)</italic></source>, <fpage>1</fpage>–<lpage>8</lpage>. <publisher-name>IEEE</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_034">
<mixed-citation publication-type="chapter"> <string-name><surname>Malinka</surname> <given-names>K</given-names></string-name>, <string-name><surname>Peresíni</surname> <given-names>M</given-names></string-name>, <string-name><surname>Firc</surname> <given-names>A</given-names></string-name>, <string-name><surname>Hujnák</surname> <given-names>O</given-names></string-name>, <string-name><surname>Janus</surname> <given-names>F</given-names></string-name> (<year>2023</year>). <chapter-title>On the educational impact of chatgpt: Is artificial intelligence ready to obtain a university degree?</chapter-title> In: <source><italic>Proceedings of the 2023 Conference on Innovation and Technology in Computer Science Education v. 1</italic></source>, <fpage>47</fpage>–<lpage>53</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_035">
<mixed-citation publication-type="book"> <string-name><surname>Manning</surname> <given-names>CD</given-names></string-name>, <string-name><surname>Raghavan</surname> <given-names>P</given-names></string-name>, <string-name><surname>Schütze</surname> <given-names>H</given-names></string-name> (<year>2008</year>). <source><italic>Introduction to Information Retrieval</italic></source>. <publisher-name>Cambridge University Press</publisher-name>, <publisher-loc>Cambridge</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_036">
<mixed-citation publication-type="journal"> <string-name><surname>Mitchell</surname> <given-names>T</given-names></string-name>, <string-name><surname>Cohen</surname> <given-names>W</given-names></string-name>, <string-name><surname>Hruschka</surname> <given-names>E</given-names></string-name>, <string-name><surname>Talukdar</surname> <given-names>P</given-names></string-name>, <string-name><surname>Yang</surname> <given-names>B</given-names></string-name>, ..., <string-name><surname>Welling</surname> <given-names>J</given-names></string-name> (<year>2018</year>). <article-title>Never-ending learning</article-title>. <source><italic>Communications of the ACM</italic></source>, <volume>61</volume>(<issue>5</issue>): <fpage>103</fpage>–<lpage>115</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1145/3191513" xlink:type="simple">https://doi.org/10.1145/3191513</ext-link></mixed-citation>
</ref>
<ref id="j_jds1230_ref_037">
<mixed-citation publication-type="chapter"> <string-name><surname>Newman</surname> <given-names>D</given-names></string-name>, <string-name><surname>Lau</surname> <given-names>JH</given-names></string-name>, <string-name><surname>Grieser</surname> <given-names>K</given-names></string-name>, <string-name><surname>Baldwin</surname> <given-names>T</given-names></string-name> (<year>2010</year>). <chapter-title>Automatic evaluation of topic coherence</chapter-title>. In: <source><italic>Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics</italic></source>, <fpage>100</fpage>–<lpage>108</lpage>. <publisher-name>Association for Computational Linguistics</publisher-name>, <publisher-loc>Los Angeles, California</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_038">
<mixed-citation publication-type="journal"> <string-name><surname>Ouyang</surname> <given-names>L</given-names></string-name>, <string-name><surname>Wu</surname> <given-names>J</given-names></string-name>, <string-name><surname>Jiang</surname> <given-names>X</given-names></string-name>, <string-name><surname>Almeida</surname> <given-names>D</given-names></string-name>, <string-name><surname>Wainwright</surname> <given-names>C</given-names></string-name>, ..., <string-name><surname>Lowe</surname> <given-names>R</given-names></string-name> (<year>2022</year>). <article-title>Training language models to follow instructions with human feedback</article-title>. <source><italic>Advances in Neural Information Processing Systems</italic></source>, <volume>35</volume>: <fpage>27730</fpage>–<lpage>27744</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_039">
<mixed-citation publication-type="other"> <string-name><surname>Petroni</surname> <given-names>F</given-names></string-name>, <string-name><surname>Rocktäschel</surname> <given-names>T</given-names></string-name>, <string-name><surname>Lewis</surname> <given-names>P</given-names></string-name>, <string-name><surname>Bakhtin</surname> <given-names>A</given-names></string-name>, <string-name><surname>Wu</surname> <given-names>Y</given-names></string-name>, ..., <string-name><surname>Riedel</surname> <given-names>S</given-names></string-name> (<year>2019</year>). Language models as knowledge bases? arXiv preprint: arXiv:<ext-link ext-link-type="uri" xlink:href="https://arxiv.org/abs/1909.01066">1909.01066</ext-link></mixed-citation>
</ref>
<ref id="j_jds1230_ref_040">
<mixed-citation publication-type="journal"> <string-name><surname>Rafailov</surname> <given-names>R</given-names></string-name>, <string-name><surname>Sharma</surname> <given-names>A</given-names></string-name>, <string-name><surname>Mitchell</surname> <given-names>E</given-names></string-name>, <string-name><surname>Manning</surname> <given-names>CD</given-names></string-name>, <string-name><surname>Ermon</surname> <given-names>S</given-names></string-name>, <string-name><surname>Finn</surname> <given-names>C</given-names></string-name> (<year>2023</year>). <article-title>Direct preference optimization: Your language model is secretly a reward model</article-title>. <source><italic>Advances in Neural Information Processing Systems</italic></source>, <volume>36</volume>: <fpage>53728</fpage>–<lpage>53741</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.52202/075280-2338" xlink:type="simple">https://doi.org/10.52202/075280-2338</ext-link></mixed-citation>
</ref>
<ref id="j_jds1230_ref_041">
<mixed-citation publication-type="other"> <string-name><surname>Reimers</surname> <given-names>N</given-names></string-name>, <string-name><surname>Gurevych</surname> <given-names>I</given-names></string-name> (<year>2019</year>). Sentence-bert: Sentence embeddings using siamese bert-networks. arXiv preprint: arXiv:<ext-link ext-link-type="uri" xlink:href="https://arxiv.org/abs/1908.10084">1908.10084</ext-link></mixed-citation>
</ref>
<ref id="j_jds1230_ref_042">
<mixed-citation publication-type="journal"> <string-name><surname>Ristoski</surname> <given-names>P</given-names></string-name>, <string-name><surname>Rosati</surname> <given-names>J</given-names></string-name>, <string-name><surname>Di Noia</surname> <given-names>T</given-names></string-name>, <string-name><surname>De Leone</surname> <given-names>R</given-names></string-name>, <string-name><surname>Paulheim</surname> <given-names>H</given-names></string-name> (<year>2019</year>). <article-title>Rdf2vec: RDF graph embeddings and their applications</article-title>. <source><italic>Semantic Web</italic></source>, <volume>10</volume>(<issue>4</issue>): <fpage>721</fpage>–<lpage>752</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_043">
<mixed-citation publication-type="journal"> <string-name><surname>Robertson</surname> <given-names>S</given-names></string-name>, <string-name><surname>Zaragoza</surname> <given-names>H</given-names></string-name> (<year>2009</year>). <article-title>The probabilistic relevance framework: BM25 and beyond</article-title>. <source><italic>Foundations and Trends in Information Retrieval</italic></source>, <volume>3</volume>(<issue>4</issue>): <fpage>333</fpage>–<lpage>389</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_044">
<mixed-citation publication-type="chapter"> <string-name><surname>Röder</surname> <given-names>M</given-names></string-name>, <string-name><surname>Both</surname> <given-names>A</given-names></string-name>, <string-name><surname>Hinneburg</surname> <given-names>A</given-names></string-name> (<year>2015</year>). <chapter-title>Exploring the space of topic coherence measures</chapter-title>. In: <source><italic>Proceedings of the 8th ACM International Conference on Web Search and Data Mining (WSDM)</italic></source>, <fpage>399</fpage>–<lpage>408</lpage>. <publisher-name>ACM</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_045">
<mixed-citation publication-type="other"> <string-name><surname>Sanh</surname> <given-names>V</given-names></string-name>, <string-name><surname>Webson</surname> <given-names>A</given-names></string-name>, <string-name><surname>Raffel</surname> <given-names>C</given-names></string-name>, <string-name><surname>Bach</surname> <given-names>SH</given-names></string-name>, <string-name><surname>Sutawika</surname> <given-names>L</given-names></string-name>, ..., <string-name><surname>Rush</surname> <given-names>AM</given-names></string-name> (<year>2021</year>). Multitask prompted training enables zero-shot task generalization. arXiv preprint: arXiv:<ext-link ext-link-type="uri" xlink:href="https://arxiv.org/abs/2110.08207">2110.08207</ext-link></mixed-citation>
</ref>
<ref id="j_jds1230_ref_046">
<mixed-citation publication-type="chapter"> <string-name><surname>Suchanek</surname> <given-names>FM</given-names></string-name>, <string-name><surname>Kasneci</surname> <given-names>G</given-names></string-name>, <string-name><surname>Weikum</surname> <given-names>G</given-names></string-name> (<year>2007</year>). <chapter-title>Yago: A core of semantic knowledge</chapter-title>. In: <source><italic>Proceedings of the 16th International Conference on World Wide Web</italic></source>, <fpage>697</fpage>–<lpage>706</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_047">
<mixed-citation publication-type="other"> <string-name><surname>Team</surname> <given-names>G</given-names></string-name>, <string-name><surname>Mesnard</surname> <given-names>T</given-names></string-name>, <string-name><surname>Hardin</surname> <given-names>C</given-names></string-name>, <string-name><surname>Dadashi</surname> <given-names>R</given-names></string-name>, <string-name><surname>Bhupatiraju</surname> <given-names>S</given-names></string-name>, ..., <string-name><surname>Kenealy</surname> <given-names>K</given-names></string-name> (<year>2024</year>). Gemma: Open models based on Gemini Research and technology. arXiv preprint: arXiv:<ext-link ext-link-type="uri" xlink:href="https://arxiv.org/abs/2403.08295">2403.08295</ext-link></mixed-citation>
</ref>
<ref id="j_jds1230_ref_048">
<mixed-citation publication-type="other"> <string-name><surname>United Nations Economic Commission for Europe (UNECE)</surname></string-name> (<year>2025</year>). Generic statistical information model (GSIM) version 2.0: User guide. <uri>https://unece.org/</uri>. User Guide PDF. GSIM v2.0.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_049">
<mixed-citation publication-type="other"> <string-name><surname>US Census Bureau</surname></string-name> (<year>2025</year>a). Census API user guide. <uri>https://www.census.gov/data/developers/guidance/api-user-guide.html</uri>. Published January 16, 2025. Accessed September 1, 2025.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_050">
<mixed-citation publication-type="other"> <string-name><surname>US Census Bureau, American Community Survey</surname></string-name> (<year>2025</year>b). American community survey (ACS). <uri>https://www.census.gov/programs-surveys/acs.html</uri>. Accessed September 1, 2025.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_051">
<mixed-citation publication-type="other"> <string-name><surname>US Census Bureau, American Community Survey 1-Year Estimates</surname></string-name> (<year>2023</year>). American community survey 1-year estimates. <uri>https://api.census.gov/data/2023/acs/acs1</uri>. Accessed September 1. 2025.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_052">
<mixed-citation publication-type="other"> <string-name><surname>US Census Bureau, American Community Survey 5-Year Estimates</surname></string-name> (<year>2020</year>). American community survey 5-year estimates. <uri>https://api.census.gov/data/2020/acs/acs5</uri>. Accessed September 1. 2025.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_053">
<mixed-citation publication-type="other"> <string-name><surname>Vaswani</surname> <given-names>A</given-names></string-name>, <string-name><surname>Shazeer</surname> <given-names>N</given-names></string-name>, <string-name><surname>Parmar</surname> <given-names>N</given-names></string-name>, <string-name><surname>Uszkoreit</surname> <given-names>J</given-names></string-name>, <string-name><surname>Jones</surname> <given-names>L</given-names></string-name>, ..., <string-name><surname>Polosukhin</surname> <given-names>I</given-names></string-name> (<year>2017</year>). Attention is all you need. Advances in neural information processing systems, 30.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_054">
<mixed-citation publication-type="journal"> <string-name><surname>Vrandečić</surname> <given-names>D</given-names></string-name>, <string-name><surname>Krötzsch</surname> <given-names>M</given-names></string-name> (<year>2014</year>). <article-title>Wikidata: A free collaborative knowledgebase</article-title>. <source><italic>Communications of the ACM</italic></source>, <volume>57</volume>(<issue>10</issue>): <fpage>78</fpage>–<lpage>85</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1145/2629489" xlink:type="simple">https://doi.org/10.1145/2629489</ext-link></mixed-citation>
</ref>
<ref id="j_jds1230_ref_055">
<mixed-citation publication-type="other"> <string-name><surname>Wang</surname> <given-names>J</given-names></string-name>, <string-name><surname>Hu</surname> <given-names>X</given-names></string-name>, <string-name><surname>Hou</surname> <given-names>W</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>H</given-names></string-name>, <string-name><surname>Zheng</surname> <given-names>R</given-names></string-name>, ..., <string-name><surname>Xie</surname> <given-names>X</given-names></string-name> (<year>2023</year>a). On the robustness of chatgpt: An adversarial and out-of-distribution perspective. arXiv preprint: arXiv:<ext-link ext-link-type="uri" xlink:href="https://arxiv.org/abs/2302.12095">2302.12095</ext-link></mixed-citation>
</ref>
<ref id="j_jds1230_ref_056">
<mixed-citation publication-type="chapter"> <string-name><surname>Wang</surname> <given-names>X</given-names></string-name>, <string-name><surname>Wei</surname> <given-names>J</given-names></string-name>, <string-name><surname>Schuurmans</surname> <given-names>D</given-names></string-name>, <string-name><surname>Le</surname> <given-names>QV</given-names></string-name>, <string-name><surname>Chi</surname> <given-names>EH</given-names></string-name>, ..., <string-name><surname>Zhou</surname> <given-names>D</given-names></string-name> (<year>2023</year>b). <chapter-title>Self-consistency improves chain of thought reasoning in language models</chapter-title>. In: <source><italic>Proceedings of the Eleventh International Conference on Learning Representations (ICLR). ICLR</italic></source>. <comment>2023</comment>.</mixed-citation>
</ref>
<ref id="j_jds1230_ref_057">
<mixed-citation publication-type="other"> <string-name><surname>Wei</surname> <given-names>J</given-names></string-name>, <string-name><surname>Bosma</surname> <given-names>M</given-names></string-name>, <string-name><surname>Zhao</surname> <given-names>VY</given-names></string-name>, <string-name><surname>Guu</surname> <given-names>K</given-names></string-name>, <string-name><surname>Yu</surname> <given-names>AW</given-names></string-name>, ..., <string-name><surname>Le</surname> <given-names>QV</given-names></string-name> (<year>2021</year>). Finetuned language models are zero-shot learners. arXiv preprint: arXiv:<ext-link ext-link-type="uri" xlink:href="https://arxiv.org/abs/2109.01652">2109.01652</ext-link></mixed-citation>
</ref>
<ref id="j_jds1230_ref_058">
<mixed-citation publication-type="journal"> <string-name><surname>Yang</surname> <given-names>J</given-names></string-name>, <string-name><surname>Jin</surname> <given-names>H</given-names></string-name>, <string-name><surname>Tang</surname> <given-names>R</given-names></string-name>, <string-name><surname>Han</surname> <given-names>X</given-names></string-name>, <string-name><surname>Feng</surname> <given-names>Q</given-names></string-name>, ..., <string-name><surname>Hu</surname> <given-names>X</given-names></string-name> (<year>2024</year>). <article-title>Harnessing the power of llms in practice: A survey on chatgpt and beyond</article-title>. <source><italic>ACM Transactions on Knowledge Discovery from Data</italic></source>, <volume>18</volume>(<issue>6</issue>): <fpage>1</fpage>–<lpage>32</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1145/3649506" xlink:type="simple">https://doi.org/10.1145/3649506</ext-link></mixed-citation>
</ref>
<ref id="j_jds1230_ref_059">
<mixed-citation publication-type="chapter"> <string-name><surname>Zhang</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Han</surname> <given-names>X</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Jiang</surname> <given-names>X</given-names></string-name>, <string-name><surname>Sun</surname> <given-names>M</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>Q</given-names></string-name> (<year>2019</year>). <chapter-title>ERNIE: Enhanced language representation with informative entities</chapter-title>. In: <source><italic>Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics</italic></source> (<string-name><given-names>A</given-names> <surname>Korhonen</surname></string-name>, <string-name><given-names>D</given-names> <surname>Traum</surname></string-name>, <string-name><given-names>L</given-names> <surname>Màrquez</surname></string-name>, eds.), <fpage>1441</fpage>–<lpage>1451</lpage>. <publisher-name>Association for Computational Linguistics</publisher-name>, <publisher-loc>Florence, Italy</publisher-loc>.</mixed-citation>
</ref>
</ref-list>
</back>
</article>
