Nina Dethlefs
A divide-and-conquer approach to neural natural language generation from structured data
Dethlefs, Nina; Schoene, Annika; Cuayáhuitl, Heriberto
Authors
Annika Schoene
Heriberto Cuayáhuitl
Abstract
Current approaches that generate text from linked data for complex real-world domains can face problems including rich and sparse vocabularies as well as learning from examples of long varied sequences. In this article, we propose a novel divide-and-conquer approach that automatically induces a hierarchy of “generation spaces” from a dataset of semantic concepts and texts. Generation spaces are based on a notion of similarity of partial knowledge graphs that represent the domain and feed into a hierarchy of sequence-to-sequence or memory-to-sequence learners for concept-to-text generation. An advantage of our approach is that learning models are exposed to the most relevant examples during training which can avoid bias towards majority samples. We evaluate our approach on two common benchmark datasets and compare our hierarchical approach against a flat learning setup. We also conduct a comparison between sequence-to-sequence and memory-to-sequence learning models. Experiments show that our hierarchical approach overcomes issues of data sparsity and learns robust lexico-syntactic patterns, consistently outperforming flat baselines and previous work by up to 30%. We also find that while memory-to-sequence models can outperform sequence-to-sequence models in some cases, the latter are generally more stable in their performance and represent a safer overall choice.
Citation
Dethlefs, N., Schoene, A., & Cuayáhuitl, H. (2021). A divide-and-conquer approach to neural natural language generation from structured data. Neurocomputing, 433, 300-309. https://doi.org/10.1016/j.neucom.2020.12.083
Journal Article Type | Article |
---|---|
Acceptance Date | Dec 14, 2020 |
Online Publication Date | Jan 5, 2021 |
Publication Date | Apr 14, 2021 |
Deposit Date | Feb 1, 2021 |
Publicly Available Date | Jan 6, 2022 |
Journal | Neurocomputing |
Print ISSN | 0925-2312 |
Publisher | Elsevier |
Peer Reviewed | Peer Reviewed |
Volume | 433 |
Pages | 300-309 |
DOI | https://doi.org/10.1016/j.neucom.2020.12.083 |
Keywords | Neural networks; Artificial intelligence; Natural language processing |
Public URL | https://hull-repository.worktribe.com/output/3709268 |
Publisher URL | https://www.sciencedirect.com/science/article/abs/pii/S0925231220319950?via%3Dihub |
Files
Article
(2.1 Mb)
PDF
Publisher Licence URL
https://creativecommons.org/licenses/by-nc-nd/4.0/
Copyright Statement
©2021. This manuscript version is made available under the CC-BY-NC-ND 4.0 license http://creativecommons.org/licenses/by-nc-nd/4.0/
You might also like
User Engagement Triggers in Social Media Discourse on Biodiversity Conservation
(2024)
Journal Article
Redefining Digital Twins - A Wind Energy Operations and Maintenance Perspective
(2024)
Presentation / Conference Contribution
Intelligent digital twin - machine learning system for real-time wind turbine wind speed and power generation forecasting
(2023)
Presentation / Conference Contribution
Downloadable Citations
About Repository@Hull
Administrator e-mail: repository@hull.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search