Speech and Natural Language Processing > Natural Language Generation
Generation of text used in data-to-text, conversational agents, and narrative generation applications.
Contents
Datasets
A dataset for NLG in dialogue systems in the public transport information domain.
This dataset consists of (human-written) NBA basketball game summaries aligned with their corresponding box- and line-scores.
The repository contains the code along with the required corpora that were used in order to build a system that "learns" how to generate English biographies for Semantic Web triples.
The enriched version of the WebNLG - a resource for evaluating common NLG tasks, including Discourse Ordering, Lexicalization and Referring Expression Generation.
The Schema-Guided Dialogue (SGD) dataset consists of over 20k annotated multi-domain, task-oriented conversations between a human and a virtual assistant.
Dialog
Generate datasets for AI chatbots, NLP tasks, named entity recognition or text classification models using a simple DSL!
NNDial is an open source toolkit for building end-to-end trainable task-oriented dialogue models.
This is the Plato Research Dialogue System, a flexible platform for developing conversational AI agents.
RNNLG is an open source benchmark toolkit for Natural Language Generation (NLG) in spoken dialogue system application domains.
Statistical NLG for spoken dialogue systems.
Evaluation
Grammar
OpenCCG library for parsing and realization with CCG.
CCG: All combinators, common grammar format, parsing to logical form, parameter estimation for probabilistic CCG.
All combinators, common grammar format, parsing to logical form, parameter estimation for probabilistic CCG.
A Web platform for parsing and annotation.
Libraries
Narrative Generation
Neural Natural Language Generation
A robust Python tool for text-based AI training and generation using GPT-2.
Graph to sequence implemented in Pytorch combining Graph convolutional networks and opennmt-py.
A Neural Network based generative model for captioning images using Tensorflow.
A minimalistic codebase for finetuning and interacting with NLG models using PyTorch Lightning.
We present a PaperRobot who performs as an automatic research assistant.
Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.
Question generation is the task of automatically generating questions from a text paragraph.
Texar is a toolkit aiming to support a broad set of machine learning, especially natural language processing and text generation tasks.
Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.
This is a project allows people to train a variant of GPT-2 that makes up words, definitions and examples from scratch.
State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
For converting information present in the form of structured data into natural language text.
Realizers
Surface realiser (part of a Natural Language Generation system) using Tree Adjoining Grammar.
A JavaScript bilingual text realizer for web development.
Java API for Natural Language Generation.
German version of SimpleNLG 4.
SimpleNLG-EnFr 1.1 is a bilingual English/French adaption of SimpleNLG v4.2.