Supplementary Material

JDS

Journal of Data Science

1683-86021680-743X

1680-743X

School of Statistics, Renmin University of China

JDS1012

10.6339/21-JDS1012

Data Science in Action

Augmented Abstractive Summarization with Document-Level Semantic Graph

Qiwei

1 Li

Haoyuan

2 Lu

Kun

3 Yang

Hanfang

hyang@ruc.edu.cn14∗ 1School of Statistics, Renmin University of China, Beijing, China 2T.H. Chan School of Public Health, Harvard University, Boston, MA, USA 3Department of ORFE, Princeton University, Princeton, New Jersey, USA 4Center for Applied Statistics, School of Statistics, Renmin University of China, Beijing, China

∗Corresponding author. Email: hyang@ruc.edu.cn.

2021

452021

193450464

Supplementary Material

Supplementary material online include: data link, python code and an instruction file needed to reproduce the results; an appendix containing additional structures and experiments we have tried. The web link is https://github.com/martin6336/DSGSum.

71220201042021

2021 The Author(s). Published by the School of Statistics and the Center for Applied Statistics, Renmin University of China.

2021

Open access article under the CC BY license.

Previous abstractive methods apply sequence-to-sequence structures to generate summary without a module to assist the system to detect vital mentions and relationships within a document. To address this problem, we utilize semantic graph to boost the generation performance. Firstly, we extract important entities from each document and then establish a graph inspired by the idea of distant supervision (Mintz et al., 2009). Then, we combine a Bi-LSTM with a graph encoder to obtain the representation of each graph node. A novel neural decoder is presented to leverage the information of such entity graphs. Automatic and human evaluations show the effectiveness of our technique.

Keywords distant supervise entity extraction graph attention neural network information extraction

References

Bahdanau

, Cho

, Bengio

(2014). Neural machine translation by jointly learning to align and translate. arXiv preprint: https://arxiv.org/abs/1409.0473.

Banarescu

, Bonial

, Cai

, Georgescu

, Griffitt

, Hermjakob

, et al. (2013). Abstract Meaning Representation for sembanking. In: Proceedings of the 7th Linguistic Annotation Workshop and Interoperability with Discourse (

Dipper,

Liakata,

Pareja-Lora, eds.), 178–186. Association for Computational Linguistics, Sofia, Bulgaria.

Berg-Kirkpatrick

, Burkett

, Klein

(2012). An empirical investigation of statistical significance in NLP. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (

Pasca,

Henderson,

Tsujii, eds.), 995–1005. Association for Computational Linguistics, Jeju Island, Korea.

Celikyilmaz

, Bosselut

, He

, Choi

(2018). Deep communicating agents for abstractive summarization. In: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers) (

Stent,

Ji,

Walker, eds.), 1662–1675. Association for Computational Linguistics, New Orleans, Louisiana.

Chen

, Bansal

(2018). Fast abstractive summarization with reinforce-selected sentence rewriting. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (

Miyao,

Gurevych, eds.), 675–686. Association for Computational Linguistics, Melbourne, Australia.

Damonte

, Cohen

(2019). Structural neural encoders for AMR-to-text generation. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers) (

Solorio,

Doran,

Burstein, eds.), 3649–3658. Association for Computational Linguistics, Minneapolis, Minnesota.

Durrett

, Berg-Kirkpatrick

, Klein

(2016). Learning-based single-document summarization with compression and anaphoricity constraints. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 1998–2008. Association for Computational Linguistics, Berlin, Germany.

Fan

, Yu

, Wang

(2018). Robust neural abstractive summarization systems and evaluation against adversarial information. arXiv preprint: https://arxiv.org/abs/1810.06065

Fernandes

, Allamanis

, Brockschmidt

(2019). Structured neural summarization.

Gehrmann

, Deng

, Rush

(2018). Bottom-up abstractive summarization. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (

Tsujii,

Hockenmaier,

Chiang,

Riloff, eds.), 4098–4109. Association for Computational Linguistics, Brussels, Belgium.

Guu

, Lee

, Tung

, Pasupat

, Chang

(2020). Realm: Retrieval-augmented language model pre-training. arXiv preprint: https://arxiv.org/abs/2002.08909

Hochreiter

, Schmidhuber

(1997). Long short-term memory. Neural Comput., 9(8): 1735–1780.

Honnibal

, Montani

, Van Landeghem

, Boyd

(2020). spaCy: Industrial-strength Natural Language Processing in Python. Zenodo, https://doi.org/10.5281/zenodo.1212303

Koncel-Kedziorski

, Bekal

, Luan

, Lapata

, Hajishirzi

(2019). Text generation from knowledge graphs with graph transformers. arXiv preprint: https://arxiv.org/abs/1904.02342

Lee

, He

, Lewis

, Zettlemoyer

(2017). End-to-end neural coreference resolution. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (

Palmer,

Hwa,

Riedel, eds.), 188–197. Association for Computational Linguistics, Copenhagen, Denmark.

Liu

, Lapata

(2019). Text summarization with pretrained encoders. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP) (

Inui,

Jiang,

Ng,

Wan, eds.), 3730–3740. Association for Computational Linguistics, Hong Kong, China.

Logan

, Liu

, Peters

, Gardner

, Singh

(2019). Barack’s wife Hillary: Using knowledge graphs for fact-aware language modeling. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (

Korhonen,

Traum,

Màrquez, eds.), 5962–5971. Association for Computational Linguistics, Florence, Italy.

Luan

, He

, Ostendorf

, Hajishirzi

(2018). Multi-task identification of entities, relations, and coreference for scientific knowledge graph construction. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (

Riloff,

Chiang,

Hockenmaier,

Tsujii, eds.), 3219–3232. Association for Computational Linguistics, Brussels, Belgium.

Manning

, Surdeanu

, Bauer

, Finkel

, Bethard

, McClosky

(2014). The Stanford CoreNLP natural language processing toolkit. In: Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 55–60. Association for Computational Linguistics, Baltimore, Maryland.

Mintz

, Bills

, Snow

, Jurafsky

(2009). Distant supervision for relation extraction without labeled data. In: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP (

Wiebe,

Su,

K-Y

Su, eds.), 1003–1011. Association for Computational Linguistics, Suntec, Singapore.

Nallapati

, Zhou

, dos Santos

, Gulçehre

, Xiang

(2016). Abstractive text summarization using sequence-to-sequence RNNs and beyond. In: Proceedings of The 20th SIGNLL Conference on Computational Natural Language Learning (

Riezler,

Goldberg, eds.), 280–290. Association for Computational Linguistics, Berlin, Germany.

Paulus

, Xiong

, Socher

(2018). A deep reinforced model for abstractive summarization. In: International Conference on Learning Representations.

Rush

, Chopra

, Weston

(2015). A neural attention model for abstractive sentence summarization. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (

Màrquez,

Callison-Burch,

Su,

Pighin,

Marton, eds.), 379–389. Association for Computational Linguistics, Lisbon, Portugal.

See

, Liu

, Manning

(2017). Get to the point: Summarization with pointer-generator networks. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (

Barzilay,

M-Y

Kan, eds.), 1073–1083. Association for Computational Linguistics, Vancouver, Canada.

Seo

, Kembhavi

, Farhadi

, Hajishirzi

(2017). Bidirectional attention flow for machine comprehension. arXiv preprint: https://arxiv.org/abs/1611.01603

Sharma

, Huang

, Hu

, Wang

(2019). An entity-driven framework for abstractive summarization. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP) (

Inui,

Jiang,

Ng,

Wan, eds.), 3280–3291. Association for Computational Linguistics, Hong Kong, China.

Speer

, Havasi

(2012). Representing general relational knowledge in ConceptNet 5. In: Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC’12) (

Calzolari,

Choukri,

Declerck

Dogan,

Maegaard,

Mariani,

Odijk,

Piperidis,eds.), 3679–3686. European Language Resources Association (ELRA), Istanbul, Turkey.

Tay

, Bahri

, Metzler

, Juan

, Zhao

, Zheng

(2020). Synthesizer: Rethinking self-attention in transformer models. arXiv preprint: https://arxiv.org/abs/2005.00743

Trisedya

, Weikum

, Qi

, Zhang

(2019). Neural relation extraction for knowledge base enrichment. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (

Korhonen,

Traum,

Màrquez, eds.), 229–240. Association for Computational Linguistics, Florence, Italy.

Vaswani

, Shazeer

, Parmar

, Uszkoreit

, Jones

, Gomez

, et al. (2017). Attention is all you need. In: Advances in Neural Information Processing Systems (

Guyon,

Luxburg,

Bengio,

Wallach,

Fergus,

Vishwanathan,

Garnett, eds.), volume 30. Curran Associates, Inc.

Wolf

, Debut

, Sanh

, Chaumond

, Delangue

, Moi

, et al. (2019). Huggingface’s transformers: State-of-the-art natural language processing. arXiv preprint: https://arxiv.org/abs/1910.03771

Zhang

, Qi

, Manning

(2018). Graph convolution over pruned dependency trees improves relation extraction. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (

Riloff,

Chiang,

Hockenmaier,

Tsujii, eds.), 2205–2215. Association for Computational Linguistics, Brussels, Belgium.