<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">JDS</journal-id>
<journal-title-group><journal-title>Journal of Data Science</journal-title></journal-title-group>
<issn pub-type="epub">1683-8602</issn><issn pub-type="ppub">1680-743X</issn><issn-l>1680-743X</issn-l>
<publisher>
<publisher-name>School of Statistics, Renmin University of China</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">JDS1012</article-id>
<article-id pub-id-type="doi">10.6339/21-JDS1012</article-id>
<article-categories><subj-group subj-group-type="heading">
<subject>Data Science in Action</subject></subj-group></article-categories>
<title-group>
<article-title>Augmented Abstractive Summarization with Document-Level Semantic Graph</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Bi</surname><given-names>Qiwei</given-names></name><xref ref-type="aff" rid="j_jds1012_aff_001">1</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Li</surname><given-names>Haoyuan</given-names></name><xref ref-type="aff" rid="j_jds1012_aff_002">2</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Lu</surname><given-names>Kun</given-names></name><xref ref-type="aff" rid="j_jds1012_aff_003">3</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Yang</surname><given-names>Hanfang</given-names></name><email xlink:href="mailto:hyang@ruc.edu.cn">hyang@ruc.edu.cn</email><xref ref-type="aff" rid="j_jds1012_aff_001">1</xref><xref ref-type="aff" rid="j_jds1012_aff_004">4</xref><xref ref-type="corresp" rid="cor1">∗</xref>
</contrib>
<aff id="j_jds1012_aff_001"><label>1</label>School of Statistics, <institution>Renmin University of China</institution>, Beijing, <country>China</country></aff>
<aff id="j_jds1012_aff_002"><label>2</label>T.H. Chan School of Public Health, <institution>Harvard University</institution>, Boston, MA, <country>USA</country></aff>
<aff id="j_jds1012_aff_003"><label>3</label>Department of ORFE, <institution>Princeton University</institution>, Princeton, New Jersey, <country>USA</country></aff>
<aff id="j_jds1012_aff_004"><label>4</label>Center for Applied Statistics, School of Statistics, <institution>Renmin University of China</institution>, Beijing, <country>China</country></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>∗</label>Corresponding author. Email: <ext-link ext-link-type="uri" xlink:href="mailto:hyang@ruc.edu.cn">hyang@ruc.edu.cn</ext-link>.</corresp>
</author-notes>
<pub-date pub-type="ppub"><year>2021</year></pub-date><pub-date pub-type="epub"><day>4</day><month>5</month><year>2021</year></pub-date><volume>19</volume><issue>3</issue><fpage>450</fpage><lpage>464</lpage><supplementary-material id="S1" content-type="document" xlink:href="jds1012_s001.pdf" mimetype="application" mime-subtype="pdf">
<caption>
<title>Supplementary Material</title>
<p>Supplementary material online include: data link, python code and an instruction file needed to reproduce the results; an appendix containing additional structures and experiments we have tried. The web link is <uri>https://github.com/martin6336/DSGSum</uri>.</p>
</caption>
</supplementary-material><history><date date-type="received"><day>7</day><month>12</month><year>2020</year></date><date date-type="accepted"><day>10</day><month>4</month><year>2021</year></date></history>
<permissions><copyright-statement>2021 The Author(s). Published by the School of Statistics and the Center for Applied Statistics, Renmin University of China.</copyright-statement><copyright-year>2021</copyright-year>
<license license-type="open-access" xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>Open access article under the <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">CC BY</ext-link> license.</license-p></license></permissions>
<abstract>
<p>Previous abstractive methods apply sequence-to-sequence structures to generate summary without a module to assist the system to detect vital mentions and relationships within a document. To address this problem, we utilize semantic graph to boost the generation performance. Firstly, we extract important entities from each document and then establish a graph inspired by the idea of distant supervision (<xref ref-type="bibr" rid="j_jds1012_ref_020">Mintz et al.</xref>, <xref ref-type="bibr" rid="j_jds1012_ref_020">2009</xref>). Then, we combine a Bi-LSTM with a graph encoder to obtain the representation of each graph node. A novel neural decoder is presented to leverage the information of such entity graphs. Automatic and human evaluations show the effectiveness of our technique.</p>
</abstract>
<kwd-group>
<label>Keywords</label>
<kwd>distant supervise</kwd>
<kwd>entity extraction</kwd>
<kwd>graph attention neural network</kwd>
<kwd>information extraction</kwd>
</kwd-group>
</article-meta>
</front>
<back>
<ref-list id="j_jds1012_reflist_001">
<title>References</title>
<ref id="j_jds1012_ref_001">
<mixed-citation publication-type="other"> <string-name><surname>Bahdanau</surname> <given-names>D</given-names></string-name>, <string-name><surname>Cho</surname> <given-names>K</given-names></string-name>, <string-name><surname>Bengio</surname> <given-names>Y</given-names></string-name> (2014). Neural machine translation by jointly learning to align and translate. arXiv preprint: <uri>https://arxiv.org/abs/1409.0473</uri>.</mixed-citation>
</ref>
<ref id="j_jds1012_ref_002">
<mixed-citation publication-type="chapter"> <string-name><surname>Banarescu</surname> <given-names>L</given-names></string-name>, <string-name><surname>Bonial</surname> <given-names>C</given-names></string-name>, <string-name><surname>Cai</surname> <given-names>S</given-names></string-name>, <string-name><surname>Georgescu</surname> <given-names>M</given-names></string-name>, <string-name><surname>Griffitt</surname> <given-names>K</given-names></string-name>, <string-name><surname>Hermjakob</surname> <given-names>U</given-names></string-name>, <etal>et al.</etal> (<year>2013</year>). <chapter-title>Abstract Meaning Representation for sembanking</chapter-title>. In: <source>Proceedings of the 7th Linguistic Annotation Workshop and Interoperability with Discourse</source> (<string-name><given-names>S</given-names> <surname>Dipper</surname></string-name>, <string-name><given-names>M</given-names> <surname>Liakata</surname></string-name>, <string-name><given-names>A</given-names> <surname>Pareja-Lora</surname></string-name>, eds.), <fpage>178</fpage>–<lpage>186</lpage>. <publisher-name>Association for Computational Linguistics</publisher-name>, <publisher-loc>Sofia, Bulgaria</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1012_ref_003">
<mixed-citation publication-type="chapter"> <string-name><surname>Berg-Kirkpatrick</surname> <given-names>T</given-names></string-name>, <string-name><surname>Burkett</surname> <given-names>D</given-names></string-name>, <string-name><surname>Klein</surname> <given-names>D</given-names></string-name> (<year>2012</year>). <chapter-title>An empirical investigation of statistical significance in NLP</chapter-title>. In: <source>Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning</source> (<string-name><given-names>M</given-names> <surname>Pasca</surname></string-name>, <string-name><given-names>J</given-names> <surname>Henderson</surname></string-name>, <string-name><given-names>J</given-names> <surname>Tsujii</surname></string-name>, eds.), <fpage>995</fpage>–<lpage>1005</lpage>. <publisher-name>Association for Computational Linguistics</publisher-name>, <publisher-loc>Jeju Island, Korea</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1012_ref_004">
<mixed-citation publication-type="chapter"> <string-name><surname>Celikyilmaz</surname> <given-names>A</given-names></string-name>, <string-name><surname>Bosselut</surname> <given-names>A</given-names></string-name>, <string-name><surname>He</surname> <given-names>X</given-names></string-name>, <string-name><surname>Choi</surname> <given-names>Y</given-names></string-name> (<year>2018</year>). <chapter-title>Deep communicating agents for abstractive summarization</chapter-title>. In: <source>Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)</source> (<string-name><given-names>A</given-names> <surname>Stent</surname></string-name>, <string-name><given-names>H</given-names> <surname>Ji</surname></string-name>, <string-name><given-names>MA</given-names> <surname>Walker</surname></string-name>, eds.), <fpage>1662</fpage>–<lpage>1675</lpage>. <publisher-name>Association for Computational Linguistics</publisher-name>, <publisher-loc>New Orleans, Louisiana</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1012_ref_005">
<mixed-citation publication-type="chapter"> <string-name><surname>Chen</surname> <given-names>YC</given-names></string-name>, <string-name><surname>Bansal</surname> <given-names>M</given-names></string-name> (<year>2018</year>). <chapter-title>Fast abstractive summarization with reinforce-selected sentence rewriting</chapter-title>. In: <source>Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)</source> (<string-name><given-names>Y</given-names> <surname>Miyao</surname></string-name>, <string-name><given-names>I</given-names> <surname>Gurevych</surname></string-name>, eds.), <fpage>675</fpage>–<lpage>686</lpage>. <publisher-name>Association for Computational Linguistics</publisher-name>, <publisher-loc>Melbourne, Australia</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1012_ref_006">
<mixed-citation publication-type="chapter"> <string-name><surname>Damonte</surname> <given-names>M</given-names></string-name>, <string-name><surname>Cohen</surname> <given-names>SB</given-names></string-name> (<year>2019</year>). <chapter-title>Structural neural encoders for AMR-to-text generation</chapter-title>. In: <source>Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers)</source> (<string-name><given-names>T</given-names> <surname>Solorio</surname></string-name>, <string-name><given-names>C</given-names> <surname>Doran</surname></string-name>, <string-name><given-names>J</given-names> <surname>Burstein</surname></string-name>, eds.), <fpage>3649</fpage>–<lpage>3658</lpage>. <publisher-name>Association for Computational Linguistics</publisher-name>, <publisher-loc>Minneapolis, Minnesota</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1012_ref_007">
<mixed-citation publication-type="chapter"> <string-name><surname>Durrett</surname> <given-names>G</given-names></string-name>, <string-name><surname>Berg-Kirkpatrick</surname> <given-names>T</given-names></string-name>, <string-name><surname>Klein</surname> <given-names>D</given-names></string-name> (<year>2016</year>). <chapter-title>Learning-based single-document summarization with compression and anaphoricity constraints</chapter-title>. In: <source>Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)</source>, <fpage>1998</fpage>–<lpage>2008</lpage>. <publisher-name>Association for Computational Linguistics</publisher-name>, <publisher-loc>Berlin, Germany</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1012_ref_008">
<mixed-citation publication-type="other"> <string-name><surname>Fan</surname> <given-names>L</given-names></string-name>, <string-name><surname>Yu</surname> <given-names>D</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>L</given-names></string-name> (2018). Robust neural abstractive summarization systems and evaluation against adversarial information. arXiv preprint: <uri>https://arxiv.org/abs/1810.06065</uri></mixed-citation>
</ref>
<ref id="j_jds1012_ref_009">
<mixed-citation publication-type="other"> <string-name><surname>Fernandes</surname> <given-names>P</given-names></string-name>, <string-name><surname>Allamanis</surname> <given-names>M</given-names></string-name>, <string-name><surname>Brockschmidt</surname> <given-names>M</given-names></string-name> (2019). Structured neural summarization.</mixed-citation>
</ref>
<ref id="j_jds1012_ref_010">
<mixed-citation publication-type="chapter"> <string-name><surname>Gehrmann</surname> <given-names>S</given-names></string-name>, <string-name><surname>Deng</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Rush</surname> <given-names>A</given-names></string-name> (<year>2018</year>). <chapter-title>Bottom-up abstractive summarization</chapter-title>. In: <source>Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing</source> (<string-name><given-names>J</given-names> <surname>Tsujii</surname></string-name>, <string-name><given-names>J</given-names> <surname>Hockenmaier</surname></string-name>, <string-name><given-names>D</given-names> <surname>Chiang</surname></string-name>, <string-name><given-names>E</given-names> <surname>Riloff</surname></string-name>, eds.), <fpage>4098</fpage>–<lpage>4109</lpage>. <publisher-name>Association for Computational Linguistics</publisher-name>, <publisher-loc>Brussels, Belgium</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1012_ref_011">
<mixed-citation publication-type="other"> <string-name><surname>Guu</surname> <given-names>K</given-names></string-name>, <string-name><surname>Lee</surname> <given-names>K</given-names></string-name>, <string-name><surname>Tung</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Pasupat</surname> <given-names>P</given-names></string-name>, <string-name><surname>Chang</surname> <given-names>MW</given-names></string-name> (2020). Realm: Retrieval-augmented language model pre-training. arXiv preprint: <uri>https://arxiv.org/abs/2002.08909</uri></mixed-citation>
</ref>
<ref id="j_jds1012_ref_012">
<mixed-citation publication-type="journal"> <string-name><surname>Hochreiter</surname> <given-names>S</given-names></string-name>, <string-name><surname>Schmidhuber</surname> <given-names>J</given-names></string-name> (<year>1997</year>). <article-title>Long short-term memory</article-title>. <source>Neural Comput.</source>, <volume>9</volume>(<issue>8</issue>): <fpage>1735</fpage>–<lpage>1780</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1012_ref_013">
<mixed-citation publication-type="other"> <string-name><surname>Honnibal</surname> <given-names>M</given-names></string-name>, <string-name><surname>Montani</surname> <given-names>I</given-names></string-name>, <string-name><surname>Van Landeghem</surname> <given-names>S</given-names></string-name>, <string-name><surname>Boyd</surname> <given-names>A</given-names></string-name> (2020). spaCy: Industrial-strength Natural Language Processing in Python. Zenodo, <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.5281/zenodo.1212303" xlink:type="simple">https://doi.org/10.5281/zenodo.1212303</ext-link></mixed-citation>
</ref>
<ref id="j_jds1012_ref_014">
<mixed-citation publication-type="other"> <string-name><surname>Koncel-Kedziorski</surname> <given-names>R</given-names></string-name>, <string-name><surname>Bekal</surname> <given-names>D</given-names></string-name>, <string-name><surname>Luan</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Lapata</surname> <given-names>M</given-names></string-name>, <string-name><surname>Hajishirzi</surname> <given-names>H</given-names></string-name> (2019). Text generation from knowledge graphs with graph transformers. arXiv preprint: <uri>https://arxiv.org/abs/1904.02342</uri></mixed-citation>
</ref>
<ref id="j_jds1012_ref_015">
<mixed-citation publication-type="chapter"> <string-name><surname>Lee</surname> <given-names>K</given-names></string-name>, <string-name><surname>He</surname> <given-names>L</given-names></string-name>, <string-name><surname>Lewis</surname> <given-names>M</given-names></string-name>, <string-name><surname>Zettlemoyer</surname> <given-names>L</given-names></string-name> (<year>2017</year>). <chapter-title>End-to-end neural coreference resolution</chapter-title>. In: <source>Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing</source> (<string-name><given-names>M</given-names> <surname>Palmer</surname></string-name>, <string-name><given-names>R</given-names> <surname>Hwa</surname></string-name>, <string-name><given-names>S</given-names> <surname>Riedel</surname></string-name>, eds.), <fpage>188</fpage>–<lpage>197</lpage>. <publisher-name>Association for Computational Linguistics</publisher-name>, <publisher-loc>Copenhagen, Denmark</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1012_ref_016">
<mixed-citation publication-type="chapter"> <string-name><surname>Liu</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Lapata</surname> <given-names>M</given-names></string-name> (<year>2019</year>). <chapter-title>Text summarization with pretrained encoders</chapter-title>. In: <source>Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)</source> (<string-name><given-names>K</given-names> <surname>Inui</surname></string-name>, <string-name><given-names>J</given-names> <surname>Jiang</surname></string-name>, <string-name><given-names>V</given-names> <surname>Ng</surname></string-name>, <string-name><given-names>X</given-names> <surname>Wan</surname></string-name>, eds.), <fpage>3730</fpage>–<lpage>3740</lpage>. <publisher-name>Association for Computational Linguistics</publisher-name>, <publisher-loc>Hong Kong, China</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1012_ref_017">
<mixed-citation publication-type="chapter"> <string-name><surname>Logan</surname> <given-names>R</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>NF</given-names></string-name>, <string-name><surname>Peters</surname> <given-names>ME</given-names></string-name>, <string-name><surname>Gardner</surname> <given-names>M</given-names></string-name>, <string-name><surname>Singh</surname> <given-names>S</given-names></string-name> (<year>2019</year>). <chapter-title>Barack’s wife Hillary: Using knowledge graphs for fact-aware language modeling</chapter-title>. In: <source>Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics</source> (<string-name><given-names>A</given-names> <surname>Korhonen</surname></string-name>, <string-name><given-names>DR</given-names> <surname>Traum</surname></string-name>, <string-name><given-names>L</given-names> <surname>Màrquez</surname></string-name>, eds.), <fpage>5962</fpage>–<lpage>5971</lpage>. <publisher-name>Association for Computational Linguistics</publisher-name>, <publisher-loc>Florence, Italy</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1012_ref_018">
<mixed-citation publication-type="chapter"> <string-name><surname>Luan</surname> <given-names>Y</given-names></string-name>, <string-name><surname>He</surname> <given-names>L</given-names></string-name>, <string-name><surname>Ostendorf</surname> <given-names>M</given-names></string-name>, <string-name><surname>Hajishirzi</surname> <given-names>H</given-names></string-name> (<year>2018</year>). <chapter-title>Multi-task identification of entities, relations, and coreference for scientific knowledge graph construction</chapter-title>. In: <source>Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing</source> (<string-name><given-names>E</given-names> <surname>Riloff</surname></string-name>, <string-name><given-names>D</given-names> <surname>Chiang</surname></string-name>, <string-name><given-names>J</given-names> <surname>Hockenmaier</surname></string-name>, <string-name><given-names>J</given-names> <surname>Tsujii</surname></string-name>, eds.), <fpage>3219</fpage>–<lpage>3232</lpage>. <publisher-name>Association for Computational Linguistics</publisher-name>, <publisher-loc>Brussels, Belgium</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1012_ref_019">
<mixed-citation publication-type="chapter"> <string-name><surname>Manning</surname> <given-names>C</given-names></string-name>, <string-name><surname>Surdeanu</surname> <given-names>M</given-names></string-name>, <string-name><surname>Bauer</surname> <given-names>J</given-names></string-name>, <string-name><surname>Finkel</surname> <given-names>J</given-names></string-name>, <string-name><surname>Bethard</surname> <given-names>S</given-names></string-name>, <string-name><surname>McClosky</surname> <given-names>D</given-names></string-name> (<year>2014</year>). <chapter-title>The Stanford CoreNLP natural language processing toolkit</chapter-title>. In: <source>Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations</source>, <fpage>55</fpage>–<lpage>60</lpage>. <publisher-name>Association for Computational Linguistics</publisher-name>, <publisher-loc>Baltimore, Maryland</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1012_ref_020">
<mixed-citation publication-type="chapter"> <string-name><surname>Mintz</surname> <given-names>M</given-names></string-name>, <string-name><surname>Bills</surname> <given-names>S</given-names></string-name>, <string-name><surname>Snow</surname> <given-names>R</given-names></string-name>, <string-name><surname>Jurafsky</surname> <given-names>D</given-names></string-name> (<year>2009</year>). <chapter-title>Distant supervision for relation extraction without labeled data</chapter-title>. In: <source>Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP</source> (<string-name><given-names>J</given-names> <surname>Wiebe</surname></string-name>, <string-name><given-names>J</given-names> <surname>Su</surname></string-name>, <string-name><given-names>K-Y</given-names> <surname>Su</surname></string-name>, eds.), <fpage>1003</fpage>–<lpage>1011</lpage>. <publisher-name>Association for Computational Linguistics</publisher-name>, <publisher-loc>Suntec, Singapore</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1012_ref_021">
<mixed-citation publication-type="chapter"> <string-name><surname>Nallapati</surname> <given-names>R</given-names></string-name>, <string-name><surname>Zhou</surname> <given-names>B</given-names></string-name>, <string-name><surname>dos Santos</surname> <given-names>C</given-names></string-name>, <string-name><surname>Gulçehre</surname> <given-names>Ç</given-names></string-name>, <string-name><surname>Xiang</surname> <given-names>B</given-names></string-name> (<year>2016</year>). <chapter-title>Abstractive text summarization using sequence-to-sequence RNNs and beyond</chapter-title>. In: <source>Proceedings of The 20th SIGNLL Conference on Computational Natural Language Learning</source> (<string-name><given-names>S</given-names> <surname>Riezler</surname></string-name>, <string-name><given-names>Y</given-names> <surname>Goldberg</surname></string-name>, eds.), <fpage>280</fpage>–<lpage>290</lpage>. <publisher-name>Association for Computational Linguistics</publisher-name>, <publisher-loc>Berlin, Germany</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1012_ref_022">
<mixed-citation publication-type="chapter"> <string-name><surname>Paulus</surname> <given-names>R</given-names></string-name>, <string-name><surname>Xiong</surname> <given-names>C</given-names></string-name>, <string-name><surname>Socher</surname> <given-names>R</given-names></string-name> (<year>2018</year>). <chapter-title>A deep reinforced model for abstractive summarization</chapter-title>. In: <source>International Conference on Learning Representations</source>.</mixed-citation>
</ref>
<ref id="j_jds1012_ref_023">
<mixed-citation publication-type="chapter"> <string-name><surname>Rush</surname> <given-names>AM</given-names></string-name>, <string-name><surname>Chopra</surname> <given-names>S</given-names></string-name>, <string-name><surname>Weston</surname> <given-names>J</given-names></string-name> (<year>2015</year>). <chapter-title>A neural attention model for abstractive sentence summarization</chapter-title>. In: <source>Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing</source> (<string-name><given-names>L</given-names> <surname>Màrquez</surname></string-name>, <string-name><given-names>C</given-names> <surname>Callison-Burch</surname></string-name>, <string-name><given-names>J</given-names> <surname>Su</surname></string-name>, <string-name><given-names>D</given-names> <surname>Pighin</surname></string-name>, <string-name><given-names>Y</given-names> <surname>Marton</surname></string-name>, eds.), <fpage>379</fpage>–<lpage>389</lpage>. <publisher-name>Association for Computational Linguistics</publisher-name>, <publisher-loc>Lisbon, Portugal</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1012_ref_024">
<mixed-citation publication-type="chapter"> <string-name><surname>See</surname> <given-names>A</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>PJ</given-names></string-name>, <string-name><surname>Manning</surname> <given-names>CD</given-names></string-name> (<year>2017</year>). <chapter-title>Get to the point: Summarization with pointer-generator networks</chapter-title>. In: <source>Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)</source> (<string-name><given-names>R</given-names> <surname>Barzilay</surname></string-name>, <string-name><given-names>M-Y</given-names> <surname>Kan</surname></string-name>, eds.), <fpage>1073</fpage>–<lpage>1083</lpage>. <publisher-name>Association for Computational Linguistics</publisher-name>, <publisher-loc>Vancouver, Canada</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1012_ref_025">
<mixed-citation publication-type="other"> <string-name><surname>Seo</surname> <given-names>M</given-names></string-name>, <string-name><surname>Kembhavi</surname> <given-names>A</given-names></string-name>, <string-name><surname>Farhadi</surname> <given-names>A</given-names></string-name>, <string-name><surname>Hajishirzi</surname> <given-names>H</given-names></string-name> (2017). Bidirectional attention flow for machine comprehension. arXiv preprint: <uri>https://arxiv.org/abs/1611.01603</uri></mixed-citation>
</ref>
<ref id="j_jds1012_ref_026">
<mixed-citation publication-type="chapter"> <string-name><surname>Sharma</surname> <given-names>E</given-names></string-name>, <string-name><surname>Huang</surname> <given-names>L</given-names></string-name>, <string-name><surname>Hu</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>L</given-names></string-name> (<year>2019</year>). <chapter-title>An entity-driven framework for abstractive summarization</chapter-title>. In: <source>Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)</source> (<string-name><given-names>K</given-names> <surname>Inui</surname></string-name>, <string-name><given-names>J</given-names> <surname>Jiang</surname></string-name>, <string-name><given-names>V</given-names> <surname>Ng</surname></string-name>, <string-name><given-names>X</given-names> <surname>Wan</surname></string-name>, eds.), <fpage>3280</fpage>–<lpage>3291</lpage>. <publisher-name>Association for Computational Linguistics</publisher-name>, <publisher-loc>Hong Kong, China</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1012_ref_027">
<mixed-citation publication-type="chapter"> <string-name><surname>Speer</surname> <given-names>R</given-names></string-name>, <string-name><surname>Havasi</surname> <given-names>C</given-names></string-name> (<year>2012</year>). <chapter-title>Representing general relational knowledge in ConceptNet 5</chapter-title>. In: <source>Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC’12)</source> (<string-name><given-names>N</given-names> <surname>Calzolari</surname></string-name>, <string-name><given-names>K</given-names> <surname>Choukri</surname></string-name>, <string-name><given-names>Declerck</given-names> <surname>T</surname></string-name>, <string-name><given-names>MU</given-names> <surname>Dogan</surname></string-name>, <string-name><given-names>B</given-names> <surname>Maegaard</surname></string-name>, <string-name><given-names>J</given-names> <surname>Mariani</surname></string-name>, <string-name><given-names>J</given-names> <surname>Odijk</surname></string-name>, <string-name><given-names>S</given-names> <surname>Piperidis</surname></string-name>,eds.), <fpage>3679</fpage>–<lpage>3686</lpage>. <publisher-name>European Language Resources Association (ELRA)</publisher-name>, <publisher-loc>Istanbul, Turkey</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1012_ref_028">
<mixed-citation publication-type="other"> <string-name><surname>Tay</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Bahri</surname> <given-names>D</given-names></string-name>, <string-name><surname>Metzler</surname> <given-names>D</given-names></string-name>, <string-name><surname>Juan</surname> <given-names>DC</given-names></string-name>, <string-name><surname>Zhao</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Zheng</surname> <given-names>C</given-names></string-name> (2020). Synthesizer: Rethinking self-attention in transformer models. arXiv preprint: <uri>https://arxiv.org/abs/2005.00743</uri></mixed-citation>
</ref>
<ref id="j_jds1012_ref_029">
<mixed-citation publication-type="chapter"> <string-name><surname>Trisedya</surname> <given-names>BD</given-names></string-name>, <string-name><surname>Weikum</surname> <given-names>G</given-names></string-name>, <string-name><surname>Qi</surname> <given-names>J</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>R</given-names></string-name> (<year>2019</year>). <chapter-title>Neural relation extraction for knowledge base enrichment</chapter-title>. In: <source>Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics</source> (<string-name><given-names>A</given-names> <surname>Korhonen</surname></string-name>, <string-name><given-names>DR</given-names> <surname>Traum</surname></string-name>, <string-name><given-names>L</given-names> <surname>Màrquez</surname></string-name>, eds.), <fpage>229</fpage>–<lpage>240</lpage>. <publisher-name>Association for Computational Linguistics</publisher-name>, <publisher-loc>Florence, Italy</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_jds1012_ref_030">
<mixed-citation publication-type="chapter"> <string-name><surname>Vaswani</surname> <given-names>A</given-names></string-name>, <string-name><surname>Shazeer</surname> <given-names>N</given-names></string-name>, <string-name><surname>Parmar</surname> <given-names>N</given-names></string-name>, <string-name><surname>Uszkoreit</surname> <given-names>J</given-names></string-name>, <string-name><surname>Jones</surname> <given-names>L</given-names></string-name>, <string-name><surname>Gomez</surname> <given-names>AN</given-names></string-name>, <etal>et al.</etal> (<year>2017</year>). <chapter-title>Attention is all you need</chapter-title>. In: <source>Advances in Neural Information Processing Systems</source> (<string-name><given-names>I</given-names> <surname>Guyon</surname></string-name>, <string-name><given-names>UV</given-names> <surname>Luxburg</surname></string-name>, <string-name><given-names>S</given-names> <surname>Bengio</surname></string-name>, <string-name><given-names>H</given-names> <surname>Wallach</surname></string-name>, <string-name><given-names>R</given-names> <surname>Fergus</surname></string-name>, <string-name><given-names>S</given-names> <surname>Vishwanathan</surname></string-name>, <string-name><given-names>R</given-names> <surname>Garnett</surname></string-name>, eds.), volume <volume>30</volume>. <publisher-name>Curran Associates, Inc.</publisher-name></mixed-citation>
</ref>
<ref id="j_jds1012_ref_031">
<mixed-citation publication-type="other"> <string-name><surname>Wolf</surname> <given-names>T</given-names></string-name>, <string-name><surname>Debut</surname> <given-names>L</given-names></string-name>, <string-name><surname>Sanh</surname> <given-names>V</given-names></string-name>, <string-name><surname>Chaumond</surname> <given-names>J</given-names></string-name>, <string-name><surname>Delangue</surname> <given-names>C</given-names></string-name>, <string-name><surname>Moi</surname> <given-names>A</given-names></string-name>, et al. (2019). Huggingface’s transformers: State-of-the-art natural language processing. arXiv preprint: <uri>https://arxiv.org/abs/1910.03771</uri></mixed-citation>
</ref>
<ref id="j_jds1012_ref_032">
<mixed-citation publication-type="chapter"> <string-name><surname>Zhang</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Qi</surname> <given-names>P</given-names></string-name>, <string-name><surname>Manning</surname> <given-names>CD</given-names></string-name> (<year>2018</year>). <chapter-title>Graph convolution over pruned dependency trees improves relation extraction</chapter-title>. In: <source>Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing</source> (<string-name><given-names>E</given-names> <surname>Riloff</surname></string-name>, <string-name><given-names>D</given-names> <surname>Chiang</surname></string-name>, <string-name><given-names>J</given-names> <surname>Hockenmaier</surname></string-name>, <string-name><given-names>J</given-names> <surname>Tsujii</surname></string-name>, eds.), <fpage>2205</fpage>–<lpage>2215</lpage>. <publisher-name>Association for Computational Linguistics</publisher-name>, <publisher-loc>Brussels, Belgium</publisher-loc>.</mixed-citation>
</ref>
</ref-list>
</back>
</article>
